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Summary. — In these three lectures a review is provided of the properties of galaxy 
systems as determined from optical and infrared measurements. Covered topics are: 
clusters identification, global cluster properties and their scaling relations, cluster 
internal structure and dynamics, and properties of cluster galaxy populations. 



1. Identification, global properties, and scaling relations 

I'l. Identification. - Historical identifications of clusters of nebultE date back to 
the late years of the XVIII century [1]. The first modern method of galaxy clusters 
identification and classification was implemented by Abell [2] in 1958. Abell worked out 
apparent overdensities of galaxies in the sky by eye inspection of photographic plates of 
the Palomar Observatory Sky Survey. He used the apparent magnitudes of galaxies to 
determine approximate cluster distances which he then used to convert apparent sizes to 
physical sizes. Abell characterize d clu sters by their richness, i.e. t he n umber of galaxies 



in the 2 mag range 7713 to + in a circle of radius 2.1 Mp af^l corrected for the 



(^) ma is the magnitude of the third brightest galaxy in the cluster field. 

(^) Ho — 70 km s~^ Mpc~^, fim ~ 0.3, Qa = 0.7 are adopted throughout these lectures. 
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contamination by fore- and back-ground galaxies. We now refer to this radius as the 
'Abcll radius'; it is st ill widely used as a typical cluster size since it is rather close to the 



virial radius, r9n rlf'^l of a massive cluster with a velocity dispersion cr„ ~ 900 km s ^. 

Abell's catalog contained 2712 clusters, later extended to the southern hemisphere [3] 
to a total of 4073 clusters. Thanks to Abell's early work, galaxy clusters started to 
be studied as a class and not only as individual objects. Abell's catalog has formed the 
basis of the largest galaxy cluster-specific spectroscopic survey completed so far, the ESO 
Nearby Abell Cluster Survey (EN ACS 

Abell's cluster sample suffers however from two main problems, incompleteness and 
contamination by projection effects 013], although these problems are less severe than it 
is usually stated [7]. Incompleteness and contamination are crucial issues, in particular 
for cosmological studies, like e.g. the determination of the cluster mass function and its 
redshift evolution (see Sect. 11'4) ). Ideally one would like to have a cluster sample with zero 
contamination and 100% completeness. Since this is not possible, it becomes essential 
that contamination and incompleteness can be precisely estimated, so that the statistical 
sample can be corrected for. A precise estimate of the contamination and incompleteness 
of a cluster catalog can be obtained by applying the cluster identification technique to 
mock galaxy samples extracted from cosmological numerical simulations. This of course 
requires the identification technique to be exactly reproducible, and Abell's eye is not. 

The main difference of today's clusters identification techniques with respect to Abell's 
is that they are both automated and objective. Automatization requires either that 
photographic plates be digitized, or that data are digital in origin, coming from CCD 
cameras [S]. Besides automatization and objectiveness, modern clusters identification 
techniques also have a well-understood selection function and impose minimal a priori 
constraints on the properties of the systems to be identified [9J . These characteristics are 
now common to many cluster identification methods which have been developed over the 
years. Among these, some have been specifically developed to be used on photometric 
galaxy samples and some on spectroscopic galaxy samples. 

Among the methods applicable to samples of galaxies without redshift information, 
the most used is the Matched Filter (MF hereafter [ID])- In summary, the method works 
as follows. The spatial and luminosity distribution of observed galaxies in a given field 
is modeled as the sum of two contributions, one from the field, another from the cluster 

(1) D{R,m) ^b{m) + NcN{R)<j){m). 

The term b{m) represents the background galaxy counts at a given magnitude, m. The 
cluster term is itself the product of three factors. N{R) is the projected radial profile of 
the cluster galaxies as a function of the projected radial distance from the cluster center, 
R, (f){m) is the differential cluster luminosity function (LF hereafter; see Sect. II "2]) . and 



(^) The virial radius r2oo is the radius within which the enclosed average mass density of a 
cluster is 200 times the critical density, 200 pc- 
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Nc is a measure of the cluster richness, or multiplicity. Both N{R) and (p{m) can depend 
on free parameters, typically a characteristic length scale and a characteristic magnitude. 
The best-fit free parameters are found through a Maximum Likelihood procedure aimed 
at minimizing the difference between the observed galaxy distribution, D{R,m), and 
the model. Clusters are identified by searching for local maxima within a moving box 
of given size centered on each pixel of the filtered galaxy map array (or at each galaxy 
position If the central pixel in the box is a local maximum, and if the maximum 

exceeds a given threshold (which depends on the background noise), a candidate cluster 
is registered. An estimate of the cluster redshift results from assuming a universal value 
for the absolute characteristic magnitude of the LF. Several variants of the MF method 
have been proposed [TTl [T^l US] • 

Among the methods applicable to samples of galaxies with redshift information, by far 
the most used is the friends-of- friends percolation algorithm (FoF hereafter, see [HlllSj ). 
This method links together all galaxies within a chosen linking volume centered on each 
galaxy. At variance with Abell's method, galaxy systems are identified within a physical 
overdensity, not within a physical (fixed) size. Since the density of galaxies in a flux- 
limited survey depends on redshift, the linking volume is also scaled with redshift. In 
practice, specifying the linking volume is equivalent to specifying two linking lengths, 
one in the plane of the sky, another along the redshift direction. Different works have 
adopted different linking lengths, and different scalings with the galaxy density (compare, 
e-g-, [H] to [12]). The linking lengths have been chosen using a priori knowledge of the 
physical characteristics of the galaxy systems one is looking for [17j . or by minimizing 
the differences between the recovered and intrinsic properties of systems identified in a 
mock galaxy sample |18) . 

While originally conceived to work on imaging surveys, in its modified versions the 
MF method can also be applied to spectroscopic surveys [TTl US] ■ Symmetrically, when 
the redshift information is not available, it is still possible to adopt the FoF method, using 
photometric, rather than spectroscopic, redshifts [T^]. Nevertheless, applications of the 
MF (respectively, FoF) method have so far mostly concerned data from photometric 
(respectively, spectroscopic) surveys. 

The MF algorithm has been applied to data from the ESO Imaging Survey [201 HI] > 
the Sloan Digital Sky Survey (SDSS [H]), the 2 Micron All Sky Survey (2MASS [22]), and 
several other surveys [231 [2^ [2^ I2S1 IT^ . Clusters in the resulting catalogs are detected 
out to z ~ 1 and beyond, and down to masses ~ IO^^'A/q, depending on the depth 
of the photometric survey. Spectroscopic follow-ups show the photometrically identified 
high-z cluster candidates to be real [27l [28]. Comparison to mock catalogs show that 
completeness can reach ~ 100% for ~ 10"'^^ Mq mass clusters out to intermediate- z, and 
~ 50% for very massive clusters (^ 10^^ Mq) out to z ~ 1.5 [Tni[I3]. In comparison to 
X-ray cluster surveys, these optical cluster surveys have been able to detect lower- mass 
clusters [241 129j and clusters with an X-ray luminosity below what expected given their 
mass (see, e.g., [30l |3T1|32] ) . Less than 10% of X-ray detected clusters with z < 0.5 are 
missed in these optical cluster surveys [55] . 

The FoF method has been applied to many spectroscopic surveys, e.g. the Center for 
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Astrophysics Redshift Survey [151 Ull IM] , the Southern Sky Redshift Survey [35] , the Las 
Campanas Redshift Survey [36], the ESO Shoe Project survey [S^, and, more recently, 
the SDSS, the Two Degree Field Galaxy Redshift Survey (2dFGRS), and the 2-Micron 
All Sky Redshift Survey (2MRS) [TTIIMI dSllSSl HOI HI] ■ The resufting catalogs Hst up to 
several thousands of galaxy systems. Most of them are small galaxy groups, with median 
velocity dispersion and mass ~ 200 km s"^, M ^ 10^^ Mq. Clearly, identification in 
3-d space (spatial coordinates and redshifts) allows to detect lower-mass systems than 
identification in projected spatial distribution only. On the other hand, the higher depth 
of photometric surveys allows the detection of (rather massive) clusters out to higher-z. 

Several other cluster identification methods have been implemented. Some methods 
start with a FoF identification, then provide a first estimate of the mass of the identified 
system, and use this estimate to determine which galaxies belong to the group, in an iter- 
ative way [m 1331 133] . These variants of the FoF method try to minimize contamination 
by galaxies that are close in redshift but not in real space (interlopers). This is crucial 
when the contamination risk is high, like in medium- or high-z samples. An application 
of this technique to the Canadian Network for Observational Cosmology (CNOC) 2"'' 
survey has produced a catalog of ^ 200 galaxy groups at a median z — 0.33 [42] . 

The Cluster Red Sequence (CRS hereafter [31]) method is based on the observation 
that all rich clusters, at all redshifts up to z ~ 1, have a more or less well defined red 
sequence of galaxies in a color-magnitude diagram, where the color is defined by two 
photometric bands bracketing the 4000 A break feature of galaxy spectra. Since the 
4000 A break is redshifted into different observational bands depending on the galaxy 
z, with a suitable set of filters it is possible to define color-cuts to select galaxies at 
a redshift close to the cluster mean redshift. By comparison with spectrophotometric 
models, the CRS method also provides estimates of the mean redshifts of the detected 
clusters, with an accuracy superior to that reached by the MF method 45J. Roughly 
speaking, the method consists in slicing a given galaxy catalog in color, computing the 
galaxy surface density of that slice, and identifying significant overdensities. An example 
of a cluster detected with this technique [46] at an estimated redshift z ~ 1 is shown in 

Fig.m 

The CRS method has been first tested on the CNOC-2 data-set, for which spec- 
troscopy is available, using only two-band photometry. In practice, this is a comparison 
of the photometric CRS vs. the spectroscopic FoF methods. The fraction of clusters 
detected with the CRS and undetected with the FoF method, i.e. the fraction of false 
positive CRS detections, is only 1/23 out to z = 0.5, and the CRS photometric z- 
estimates are accurate to Az = 0.03. Given that many of the identified systems are 
groups rather than rich clusters, the performance of the CRS method is remarkable. Ap- 
plication of this method to mock data-sets extracted from the Millennium simulation has 
shown that projection effects are of relatively minor importance in the great majority of 
the identified systems (80 to 90%, depending on z) [47 . 

The CRS method has been applied to photometric data obtained in two wavebands 
with large format mosaic cameras at CFHT and CTIO. The resulting catalog (the Red- 
sequence Cluster Survey, see [46l [48]) contains almost 1000 cluster candidates among 
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Fig. 1. - Color composite image of a cluster identified with the CRS technique [46]. The cluster 
is located at an estimated redshift of 0.952. Red-sequence galaxies are displayed with a reddish 
color in the figure, and red-sequence galaxy isodensity contours are displayed. The rectangle 
identify the BCG. The image size is 15' x 15', i.e. ~ 7 x 7 Mpc at the cluster estimated distance. 
Note how the galaxy colors are helpful in identifying the cluster against the background. 



which more than a hundred at z '--^ 1, some of which have been spectroscopically confirmed 

The CRS method has been appUed to infrared (IR hereafter) data obtained with the 
IRAC camera onboard Spitzer [50l [51] . At higher z the 4000 A break is progressively 
shifted towards the IR, hence deep IR observations are needed to detect cluster red- 
sequences at z > 1. Hundreds of candidate clusters out to an estimated redshift z = 1.85 
have been found so far by this method using data from Spitzer surveys, of which more 
than one hundred above z = 1 [501 HI] ■ Other spectral features than the 4000 A break 
can be used to define IR color cuts aimed at reducing the field galaxy contamination, 
e.g. the ~ 1.6 /im peak in the flux distribution of stellar populations 52J. Based on this 
spectral feature z > 1 clusters can be selected by using two Spitzer IRAC bands (3.6 and 
4.5 Ilia) which will still be in operation in the Spitzer postcryogenic era. Recently, data 
from the Japanese IR satellite AKARI have been used to detect candidate clusters at 
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0.9 < z < 1.7 by exploiting both the 4000 A break and the 1.6 peak features [53] . 

Another cluster identification method that relies upon the existence of a red sequence 
for cluster galaxies, is maxBCG [2T1[9]. The maxBCG method also relies on the location 
of the brigthtest cluster galaxy (BCG) near the cluster center. Application of this method 
to the SDSS data |54] has provided a catalog of 13,823 clusters with cr„ > 400 km s~^, 
out to z = 0.3, with photometric z estimates accurate to Az ~ 0.01. Comparison 
with a cluster catalog constructed by applying the MF method on the same data-set 
has shown that ~ 80% of the systems are identified by both the maxBCG and the MF 
method [H]. The imperfect matching may be due to the presence of substructures (see 
Sect. 12 5p identified as distinct clusters by the maxBCG method, and to the presence of 
false positives in both catalogs. Comparison with mock catalogs indicates the maxBCG 
cluster catalog is 90% pure and 85% complete for clusters with masses > 10^** Mq. 

Relying upon the existence of a red sequence for cluster galaxies has certainly proven 
to be a very effective way of selecting galaxy clusters. On the other hand, unrelaxed, 
low-mass galaxy clusters in which this sequence is not established yet, or at least not 
very prominent |24j . may be missed by CRS methods and alike. This is particularly true 
at high z, since the fraction of early- type galaxies (ETGs in the following) decreases with 
z (see Sect. 13 '1]) making the red sequence less and less prominent. For this reason, other 
cluster detection methods make use of multi-band photometry to allow a wider selection 
of galaxy spectral types, star-forming galaxies included. Typically, but not exclusively, 
this is done by defining photometric-z through the comparison of the galaxy spectral 
energy distributions with model templates. The Cut and Enhance (CE) [55] and the C4 
methods [5^ have been developed to make full use of the SDSS 4-bands photometry. 

At high z, IR photometry proves essential to identify galaxy cluster candidates. A 
clear demonstration of the potential of cluster searches conducted in the IR is the detec- 
tion of a z = 1.41 cluster [57. . This cluster has been identified in the Spitzer IRAC Shal- 
low survey as an overdensity of IR-selected galaxies (see also [SS] [5D]). Each galaxy was 
assigned a photometric-redshift probability distribution, which was then used to weigh 
the density maps within overlapping redshift slices. A wavelet technique was adopted to 
smooth the density field, in which significant peaks were then looked for. The z = 1.41 
cluster is only the highest-z confirmed detection among 335 galaxy cluster and group 
candidates (average mass ~ 10^* Mq) found with this technique in a 7.25 deg^ region 
in the Spitzer IRAC Shallow survey [SHI HO] ■ Over a hundred of the cluster candidates 
have a redshift above unity, with an estimated spurious detection rate of 10%. Twelve 
of the z > 1 candidates have already been spectroscopically confirmed. Unfortunately, 
the photometry is not deep enough to identify clusters much beyond z « 1.5 [59] . 

IR-color selection is also useful to improve the efficiency of spectroscopic follow-ups 
of high- z clusters identified in other wavebands, e.g. in X-rays [SUl [5T] . 

A rather different method for the identification of galaxy systems is the Voronoi 
Galaxy Cluster Finder (VGCF [62j). In this method, the projected space is divided in 
cells according to the Voronoi tesselation technique, each cell containing a single point 
(i.e. a galaxy; see Fig. (^J). The inverse of the cell area defines the local galaxy density. 
Clusters are defined as ensembles of adjacent cells with a density above a given threshold. 
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Fig. 2. - An example of Voronoi tesselation of a galaxy field. Each triangle represents a galaxy 
position on the plane of the sky [62) . 



The search for clusters is done in magnitude bins. The main advantages of this method 
is that it is nonparametric and as such it does not require a priori hypotheses on the 
cluster properties, such as cluster size, density profile, or shape. The VGCF method 
has been shown to be competitive with (or even better than) the MF method |63j. A 
comparison between the CE, MF, maxBCG, and VCGF methods has shown that at low 
redshifts (z < 0.4) the CE is more complete, but pays the price of a higher rate of false 
detections (~ 30%) as measured from Monte Carlo simulations '55]. 

If multicolor photometry is available, the VGCF method can be applied to subsamples 
selected in the color-magnitude diagram, thus reducing the field contamination by the 
same technique adopted in the CRS method [Bl]. This VGCF+CRS method has been 
used to detect clusters in fields previously observed by the Chandra X-ray satellite [65] , 
The optical detection fraction of X-ray-detected clusters was 46% vs. an X-ray detection 
fraction of optically-detected clusters of only 11%. While part of the optical detections 
may be spurious, the cluster detection threshold was clearly lower in the optical catalogs 
than in the X-ray catalogs. Galaxy-rich clusters without X-ray counterparts are also 
detected, suggesting the existence of a population of underluminous (perhaps not yet 
viriahzed) X-ray clusters (see, e.g., [3D1 [HTJ [35] ) . Sufficiently deep photometric data-sets 
allow the VGCF method to detect clusters as far as z ~ 1 and beyond \l9i . 
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With some modifications [66], the VGCF method has also been used on 3-d data-sets, 
such as the DEEP2 Galaxy Redshift Survey, resulting in a catalog of 105 galaxy systems 
with median z = 0.86, and median = 480 km s~^ [67] . 

Eventually, when the data are too shallow to rely on galaxy number counts, it is 
possible to identify cluster candidates as positive surface-brightness fluctuations (SBF) 
in the background sky [SSI [Mj ■ An application of the SBF method to a drift-scan survey 
has produced a catalog of ^ 1000 cluster candidates a,t z < 0.8, the Las Campanas 
Distant Cluster Survey (LCDCS [MllIQ]). At z < 0.3, the catalog contains systems with 
masses typical of galaxy groups. This catalog has formed the basis for the ESO Distant 
Cluster Survey (EDisCS [71]), in which 20 LCDCS candidate clusters have been followed 
up spectroscopically with VLT/F0RS2. 

Going to the next step of the cosmic hierarchy, catalogs of clusters can be used to 
define superclusters [75] . Superclusters have been identified either by the FoF percolation 
technique [731[711[751[73], or as overdensities in smoothed cluster-density fields [7711751175] . 
Large-scale structure morphology has also been characterized statistically, without the 
need of defining superclusters, e.g. by the Genus statistics [5D], or by the use of Minkowski 
functionals [81 and their combination (the Shapefinder statistic, see j82)). 

1'2. Global properties: richness, luminosity, mass. - In order for cluster catalogs to be 
useful in cosmological studies, they should also provide cluster mass estimates. Cluster 
masses can be estimated directly, by applying dynamical methods (virial theorem, Jeans 
equation, . . . ) to the sample of cluster galaxies, if redshifts are available; by solving 
the hydrostatic equation for the X-ray emitting intra-cluster plasma; or by analyzing 
the effects of gravitational lensing on background galaxies in the cluster field. All these 
methods are very expensive in terms of telescope time, hence it is customary to use 
cheaper global cluster quantities that can serve as mass proxies. In optical cluster studies, 
the most used mass (M hereafter) proxies are the cluster richness (or multiplicity), N, 
namely the number of galaxies contained in a cluster, and the cluster luminosity, L, 
both measured in a certain magnitude range and out to a certain radius from the cluster 
center. 

N and L are evaluated by counting galaxies and, respectively, summing their lumi- 
nosities, in a given region of space where we know the sample is uniformly complete down 
to a certain magnitude, rric. If the sample suffers from incompleteness, a correction must 
be applied. Both N and L must then be corrected by subtracting the expected contam- 
ination by field galaxies (which can be estimated in a comparison empty field or from 
the number counts of general field galaxy surveys, see, e.g., |83j). In comparing the N 
and L estimates of clusters at different distances, the limiting magnitudes used must be 
accordingly scaled. 

If one is looking for a mass proxy, the summed luminosity of the brightest galaxies 
could suffice. On the other hand, if what is needed is the total cluster luminosity, an ex- 
trapolation is required to account for the contribution of the faint galaxies with m > mc- 
Such an extrapolation is done by fitting a suitable function to the observed magnitude 
distribution, and then integrating this function from rUc to the magnitude of the faintest 
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galaxies. By far, the most widely used function is the Schechter LF [84] 

(2) (f>il)dl = (l/hr expi-l/L) d{l/L), 

where I is the galaxy luminosity, l^, and (pi, are the characteristic luminosity and number 
density, respectively, and a is the faint-end power-law exponent. Galaxy luminosities are 
derived by converting galaxy apparent magnitudes into absolute magnitudes via knowl- 
edge of the cluster luminosity distance and of the Galactic extinction in the observational 
photometric band (see, e.g., [85]). In order to get an estimate of the total cluster lu- 
minosity within a given radius, e.g. the overdensity radius r2oo, one must determine 
the luminosity density profile, Abel-invert it to obtain the 3-d profile (see eq. (fTU)l in 
Sect. [2T]) . and finally extrapolate it to the desired radius with a suitable fitting function 
(e.g. a projected 'NFW profile [571 see also Sect. [2]). 

By the same technique it is in principle possible to get an estimate of iV(< r2oo)- 
However, extrapolation to fainter magnitudes is in this case dangerous, since faint galax- 
ies largely outnumber bright galaxies, while their integrated contribution to the total 
luminosity is only marginal. A more robust estimate of a cluster richness is provided by 
the Bgc parameter [89^, the galaxy cluster center correlation amplitude. It is measured by 
counting galaxies in a fixed aperture around the cluster center, it requires an assumption 
about the shape of the correlation function and the LF, and it must be corrected for the 
field galaxy contamination. Bgc is almost independent of the fixed aperture and chosen 
limiting magnitude |90j . 

The most classical direct method to determine M, the cluster mass, is by applying 
the virial theorem to the projected phase-space distribution of cluster galaxies (e.g. [86| ) . 
This method has been in use since the '30s, when Zwicky and Smith [91,,92j.93j provided 
the first preliminary mass estimates of the Coma and Virgo clusters. Their studies marked 
the discovery of dark matter (DM hereafter; see, e.g. [I]). The virial theorem is obtained 
by integrating the equation of hydrostatic equilibrium for the galaxy distribution in the 
potential well of a cluster (the Jeans equation, see |HS])- In terms of the observables the 
virial theorem can be expressed as follows, 

(3) M = 3TTPalRh/G, 

where G is the gravitational constant, is the line-of-sight velocity dispersion of cluster 
galaxies, and Rh is the harmonic mean radius of the projected spatial distribution of 



cluster galaxieaf ) 



(4) i?,-^ 



2 \^ 

2-1 i>j ^ij 



(*) Note that it is customary to use another quantity, usually called the "virial radius" (see e.g. 
[96] 1. that equals twice the harmonic mean radius. However, the radius at a given overdensity, 
r200 or rioo, is also referred to as the "virial radius". In order to avoid confusion I prefer to use 
here the harmonic mean radius. 
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where Rij is the projected distance between two cluster galaxies, and n is the number of 
cluster galaxies. The factor Stt corrects for projection effects |941l96j. The factor P is the 
surface pressure term |951 196] . It is a correcting factor needed when the entire cluster is 
not included in the observed sample. It can be understood as follows. Suppose you have 
a galaxy orbiting a cluster with its apocenter at radius a and you observe the cluster only 
out to the radius r/, with r; < a. Making the wrong assumption that the entire cluster 
has been observed corresponds to imposing a smaller apocenter to the galaxy, since a 
cannot be larger than r; under this assumption. Given the galaxy velocity, the same 
orbital anisotropy, and the same cluster density profile, imposing a smaller value for the 
galaxy orbital apocenter corresponds to forcing a larger mass inside r;. The correction 
factor can be evaluated as follows: 

(5) P=l-4nrf ^^"'^ ""^'^^ 



where r; is the limiting observational radius, p(r) is the tracer mass density distribution, 
ar is the radial component of the velocity dispersion, and a{< ri) is the integrated velocity 
dispersion within r/ IJSJ. Knowledge of p(r) and of the galaxy velocity anisotropy profile 
is formally needed in order to solve for P; in practice one uses theoretical prejudice 
in order to make reasonable assumptions for these profiles. The resulting P estimate 
is therefore only an approximation, but this is preferable to no correction at all, since 
the correction factor is systematic, P < 1 (typically, P ~ 0.8-0.9 for a typical cluster 
observed out to r; ~ 2 Mpc [97]). 

Since one is using the velocity and spatial distribution of cluster galaxies in eqs. (I3l4p . 
the resulting M estimate will only be correct if the galaxies are distributed like the 
mass [95l |98]. Given that different galaxy populations are distributed differently in 
projected phase-space (see, e.g., [H]), the choice of the tracer may change the M estimate 
rather substantially. Analysis of the mass profiles of galaxy clusters have shown that the 
distribution of ETGs and red-sequence galaxies is similar to that of the cluster mass (see 
Sect. I2'2( ). hence they are to be preferred over late- type, star- forming (blue) galaxies as 
tracers of the potential when using the virial theorem to determine the cluster mass. 
Virial masses estimated using late-type (emission-line) galaxies can be 50% higher than 
those derived from ETGs [lOOj. 

How reliable are the cluster masses derived by applying the virial theorem to cluster 
galaxies? An analysis of clusters extracted from cosmological numerical simulations has 
shown that their masses are only 10% overestimated by application of the virial theorem 
to samples of n > 60 particles (galaxies) . Part of the overestimation is caused by the 
presence of interlopers, part by the presence of subclustering. Selecting only ETGs or 
only relaxed clusters reduces the bias in the mass estimate. Since much of the problem 
lies in the estimate of Rh, an alternative mass estimate based on ay only can be a viable 
alternative to the virial mass estimate |1011 Wl\ . 

Recent analyses of medium-distant clusters (CNOC, EDisCS) have shown that virial 
mass estimates based on cluster galaxies are entirely consistent with mass estimates 
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based on gravitational lensing and/or on intra-cluster gas X-ray emission |102[|103] . thus 
confirming the consistency found on local cluster samples [96l I104j . Other analyses have 
found that virial mass estimates are generally higher than the mass estimates based on X- 
ray emission |105l 11061 11071 11081 11091 IllOj . In general, the largest discrepancies between 
different mass estimates are found for dynamically unrelaxed clusters |108|I110|I11HI112] . 

An indirect confirmation of the reliability of virial mass estimates has come from the 
comparison of the values of cosmological parameters obtained using these mass estimates, 
with the values obtained by the Wilkinson Microwave Anisotropy Probe (WMAP [1091 

1'3. Scaling relations. - For a M-proxy to be effective, it is important to know pre- 
cisely its scaling relation with the cluster mass, as well as the intrinsic scatter in this 
relation (see, e.g. |114j ). The smaller this scatter, the better is the mass-proxy relation 
constrained. Moreover, it is important to know how scaling relations evolve with red- 
shift. In fact, scaling relations are generally determined locally, on the best observational 
samples, yet they have to be applied over a wide redshift range in order to improve the 
constraints on cosmological parameters. 

On the cluster scale, both N and L have been shown to be M estimators of similar 
or even better accuracy than Lx [SSI [Ml 11151 [50] . The typical accuracy with which 
a cluster M (within a given overdensity) can be predicted by N or L is ~ 30 — 40% 
[m nm M [Ml [Ml [nni- At least half of the observed scatter in the M- N 

and M — L relations appears to be intrinsic [90]. The scatter appears to increase at 
lower masses, although it is unclear how much of this increase is due to observational 
uncertainties and how much it is due to an intrinsic larger variance of the properties of 
low-mass galaxy systems [120] . The scatter decreases when irregular and substructured 
clusters are removed from the sample jl211 [5S] . 

The M — N and M — L relations are almost linear on the cluster scale, but not 
quite so, and most studies (with some notable exceptions [2T1[22]) indicate a mild [1221 
Tm nm nm nm ITM [Sl [Ml M or very mlld [Uni M [US] increase of the M/L 
and M/N ratio with cluster mass, M/L oc M" with a ~ 0.2 ± 0.1. The M/L ratio is 
not however well described by a simple power-law from the cluster to the group mass 
scale. It steepens considerably for masses below ~ 3 x 10^'^ Mq reaching a minimum at 
- 10^2 [ngj [T^Tl [UHl [US] (see Fig. [3). The non-linearity of the M - L relation at 
low masses is however of little concern for most cosmological studies based on cluster 
number density, since the completeness required by these studies is generally achievable 
only at the high-M end, at least at high-z. 

The M/L variation with M in galaxy systems is well fit by the theoretical predic- 
tions of semi-analytical models (e.g. [130] : see Fig. ^ in which the efficiency of galaxy 
formation is inhibited by the reheating of cool gas on small-mass scales, and by the long 
cooling times of hot gas on large-mass scales (see, e.g. 127J.) Other models that could 
produce the observed M/L variation require galaxy merging and/or destruction to occur 
with different efRciency in galaxy systems of different mass. These mechanisms seem 
however to be ruled out by the similar shape of the LFs of clusters with different masses 
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Fig. 3. ~ Observed M/L ratio for clusters and groups (dots) and their median values (circles) 
with 90% confidence levels error bars, are compared with theoretical predictions from two semi- 
analytical models (upper panel: |130| . lower panel: |131| V From [116] . 



[126j , and by the lack of any significant evolution of the cluster N — M relation up to 
z^l p^[m H51[TM|. 

In order to understand the possible biases involved in converting a given proxy into 
an M estimate, it is useful to compare scaling relations involving different proxies, or, 
equivalently, to determine the relations between them. In particular, it is interesting to 
compare optical and X-ray properties of galaxy clusters. 

In the comparison of X-ray and optical properties of galaxy clusters, two anomalies 
emerge: (i) there is a population of optically selected clusters with too low an X-ray 
luminosity, Lx, for their cr„ or optical richness (see Fig.HJ [5U1I5T1 [1^51 11361 11371 15^ ). 
and (ii) there is a population of X-ray bright galaxy groups with too small a ay for their 
X-ray temperature or luminosity |138l 11391 1140] . 

The X-ray underluminous clusters appear anomalous only with respect to the Lx — N 
relation established for X-ray-selected cluster samples. As a matter of fact, low-mass X- 
ray selected clusters do follow the same scaling relations established for clusters of higher 
masses |141j . But X-ray selection excludes low-L^, high-A^ clusters from the sample, 
leading to an Lx — N relation that is biased high relative to the true underlying relation 
[24l I142[ I143j . The true Lx — N relation is characterized by a large dispersion of the 
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Fig. 4. - The relation between X-ray luminosity and richness for a sample of nearby optically- 
identified clusters. Filled circles are detections, open symbols are upper limits in X-ray luminos- 
ity (i.e. optically identified clusters undetected in X-ray). The solid (dotted) line is the relation 
that fits the data excluding (including) the non-detections. From [145j . 



Lx values at given N. This can be partly explained by systematic errors in both the 
X-ray and optical estimates (see, e.g., |144| ). However, part of the dispersion may be 
intrinsic and related to different properties of the X-ray underluminous clusters relative 
to the normal X-ray emitting cluster population. X-ray underluminous clusters appear 
in fact more irregular |145| . and the properties of their galaxies are reminiscent of those 
of infalling field galaxies [3^. In other words, the X-ray underluminous clusters look like 
unvirialized systems. 

While X-ray underluminous clusters may be systems at an early stage of their dy- 
namical evolution, X-ray bright groups with abnormally small may be systems at an 
advanced stage of their dynamical evolution. A possible way to reduce the group (jy is 
to slow down galaxies by the process of dynamical friction |146j . Although the charac- 
teristic time of this process is generally longer than a Hubble time for cluster galaxies, 
it is shorter than this for massive group galaxies [140] . Another possibility is that tidal 
interactions transfer part of the orbital kinetic energy of group galaxies to their internal 
energy [140] . It cannot however be excluded that the intrinsic group are underes- 
timated because of projection, if these groups have a flattened distribution of galaxies 
with an anisotropic velocity dispersion tensor [140] . 

In order to remove possible biases in the selection of galaxy clusters, and therefore in 
the determination of cosmological parameters, we need to better understand the nature 
of these outliers from the relations between optical and X-ray properties. 
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1'4. Constraints on cosmological parameters and future surveys. - A traditional way 
of constraining is by determining the mean luminosity density, pL, and the mean 
mass-to-light ratio of the universe, < M/L >, via Vim =< M/L > xpi^/pc, where pc 
is the universe critical density. This is known as the Oort technique. This technique 
works if the systems used to measure < AI/L > are representative of the universe as a 
whole. M/L is an increasing function of M at the galaxy and group scales and flattens 
at the cluster scales (see Sect. II 3p . Hence, the average < AI/L > of rich, massive 
clusters should be representative of the universal value. This is also expected from the 
fact that clusters are assembled from regions > 10 Mpc across, so they should contain 
a sufficiently large collapsed volume to provide a representative sample of the average 
M/L of the universe [5D]. Note however that if the galaxy correlation function on these 
scales is not unbiased with respect to total matter, the value for one obtains from 
the average M/L of clusters via the Oort method does also depend on o-g, the amplitude 
of mass fluctuations on 8 h^^ Mpc scales |147| . 

The first application of Oort's technique probably dates back to 1965 when Abell 
|148j was able to constrain Qm in the interval 0.1 — 1.0 (using number- rather than 
luminosity-density). The constraints based on this technique tightened significantly in 
the 1990's, yielding flm — 0.2 ± 0.1 and therefore indicating a low-density universe with 
high statistical significance [TM fTCTl [TSTl [T5^ . 

Another way of constraining the cosmological parameters is by estimating the number 
density of galaxy clusters (and superclusters), as a function of their mass, n{M). Con- 
straints on the combination of the and cts parameters can be obtained by comparing 
the observed n{M) with theoretical predictions (e.g. |1531 11541 1155| ). The study of the 
evolution of n{M) with z, n{M, z), can provide even stronger constraints on cosmological 
models, allowing to break the l^m-crs degeneracy |1561 1157j . 

Recent analyses of optically-selected cluster samples, some based on the Oort tech- 
nique, some based on n{M, z), provide estimates for Qm and erg in agreement with the val- 
ues obtained by WMAP 5-yrs [US] (see Fig. [51 [TT71 1551 IT^ [T^ [TIM [TTfl [1551 [Ml liSj 

Superclusters can be used to constrain the evolution of cosmic structure in a more 
linear regime than applicable to galaxy clusters. It has been found that the Millennium 
simulation lacks very rich superclusters compared to the real universe |78j . Similarly, the 
existence of a very massive and compact supercluster recently detected at z ~ 0.9 [134] 
is a rather unlikely event to be expected a priori in the currently favored cosmology. 

While these results are encouraging, they are not yet competitive with those obtained 
with X-ray cluster surveys (see, e.g., [160) ). However, X-ray cluster selection, as well as X- 
ray cluster mass estimates may suffer from their own systematics (see, e.g., [1611 1162) ). 
Moreover, at z > 1 X-ray selection of clusters does not seem to be as efficient as IR 
selection [5ni[50|, and large samples of optical/IR- selected clusters are expected to come 
from ongoing and future very large {'^ thousands of square-degrees) optical/I R su rveys. 
Among these, the currently ongoing Red Sequence Cluster Survey 2, RCS-^^ is the 



{'') http://www.rcs2.org/ 
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Fig. 5. - Recent estimates of Qrn and as obtained via tfie Oort metfiod or via tfie determination 
of n{M, z) for optically-selected cluster samples. The shaded regions represent the WMAP 5-yrs 
allowed range for the same parameters [158] . References for the plotted points are: B03= [TT7] . 
L03=[85], A04=[109J, Eke=ll28] (upper panel) or [159] (lower panel), R04=tl24], T05=[l47]. 
M07=[5U], G07=[1H]. 1-cr error bars (as given by the authors) are displayed. Note that, when 
needed, fim has been estimated given erg from [158| as a prior, and vice versa. 



largest systematic search for galaxy clusters ever undertaken. It is based on the deg^ 
MegaCam imager on the CFHT, it will image ~ 1000 deg^ down to g' = 25.3, r' = 
24.8, z' = 22.5, and will detect clusters of galaxies up to 2; '--^ 1 using the CRS technique. 
It is estimated that upon completion, the survey will provide a sample of several thousand 
clusters. With such a sample it should be possible to constrain Q.^ to an accuracy of 
±0.02 and cts to an accuracy of ±0.05, and to estimate the equation of state of dark 
energy, w, to within 10% [163j . 

Other very ambitious surveys, most of them aimed at the characterization of the 
equation of state of dark energy and its evolution, are currently planned. They will prove 
very useful for distant cluster searches. A four-band survey is planned at the Panoramic 
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Survey Telescope And Rapid Response System, Pan-STARSSf^l being developed at the 
University of Hawaii's Institute of Astron omy . It will cover 1200 deg^ down to g = 27 in 
four bands. The Kilo-Degree Survey, KiD af^l is a 1500 deg^ public imaging survey in the 
five SDSS bands that will use the OmegaCAM instrument at the VLT Survey Telescope 
to go 2 magnitudes deeper than SDSS. It will be complemented by a near-IR survey, the 
VISTA Kilo-degree INfrared Galaxy survey, VIKING. The Dark Energy Survey, DE^ 



will cover 5000 deg^ in 5 bands {g,r,i, z,Y) with a wide-field camera to be installed at 
the Blanco 4 m telescope at CTIO [164]. 

Not only imaging, also spectroscopic surveys are planned, from which catalogs of 
clusters of ga laxi es will probably be extracted. The Baryon Oscillation Spectroscopic 
Survey, BOS Sf^l will observe 10,000 deg^ and obtain redshifts for 1,5 million red lumi- 
nous galaxies to z = 0.7, using the SDSS 2.5m telescope and spectrographs. BOSS is part 
of the SDSS-III and should provide the first data-release in July 2011. Another planned 
spectroscopic survey (1 million gal axies in 100 nights) is the Hobby-Eberly Telescope 
Dark Energy Experiment, HETDEJ^lT^ 



From space, a significant increase in the number of z > 1 clusters can come from 
a mid-I R su rvey to be conducted with the Spitzer Space Telescope during its warm 
mission expected to last for about 2 years (see, e.g., [55]). In the longer term, an 



unprecedente d am ount of data may be provided by two proposed space -based missi ons 



the mer ging of two proposed missions, DUNE jf^'^)! and SPACEK. 



ESA's EuchOCi 

(see, e.g., [US]), and JDEIvli^fj the Joint Dark Energy Mission of NASA and the U.S. 
Department of Energy. Euclid could provide « 1.5 x 10^ galaxy redshifts down to Hab = 
22, and a high-resolution, 3-filters, 20,000 deg^ photometric survey of > 2 x 10^ galaxies 
out to z ^ 2. 



2. Structure and dynamics 

Early determinations of cluster masses (e.g. [HIllMlinS]) from application of the virial 
theorem to cluster galaxy distributions, implicitly assumed that galaxies are fair tracers 
of the cluster gravitational potential, the so-called light traces mass hypothesis. However, 
the result of these analyses did not provide support for this assumption, since the derived 
masses were orders of magnitude larger than the sum of the masses of the visible galaxies. 
What if galaxies were not distributed like the total mass? Virial mass estimates could be 



(®) http : //pan-Starrs . if a.hawaii . edu/public/home .html 
C) http : //www. astro-wise . org/projects/KIDS/ 
C) https:// www . darkenergysurvey . org/ 
{^) http : //www. sdss3 . org/ cosmology .php 
(1°) http://hetdex 

(^^) http : / / ssc . spitzer . caltech. edu 

( ) http : / / sci . esa. int/ science-e/www/area/ index . cfm?f areaid=102 

(^'') http : //www. dune-mission.net/ 

(^'') http : //urania.bo . astro . it/cimatti/space/ 

(^"') http : //universe .nasa. gov/progr am/probes/ jdem. html 
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biased high or low [95l [98] . Comparison with other cluster mass estimates [96l 1104) , and 
analyses of simulated clusters extracted from cosmological simulations have since 
proven that the virial mass estimates are more or less correct. This then suggests that 
the projected spatial and velocity distribution of cluster galaxies is not very different from 
that of the total mass. However, proving this is not so simple, one must compare the 
distribution of the tracer population to the distribution of the total mass (see Sect. I2'3| . 

Knowing the mass distribution within clusters (also in relation to the distribution of 
the different cluster components) not only is important for a correct estimate of cluster 
masses, but also because it provides important clues on the formation and evolution of 
galaxy clusters and their components (e.g. [166[ 11671 1168j ). and on the nature of DM 
(e.g. [1691 11701 1171] ). E.g. warm DM is expected to produce lower density halo cores 
than cold DM (CDM). If DM is cold, halos should be characterized by density profiles 
with a central cusp, such as the NFW profile |172[ . 

ra\ Po 
(b) Pnfw 



(cr/r2oo)(l + cr/r2ooy 



where c is the concentration parameter. Modifications of the NFW profile have been 
suggested, all characterized by the central cusp [1731 11741 1175j . Another widely used 
cuspy density profile is the Hernquist model [176j . 

(7\ n Po 

\ ' ) PHernquist 



r (r + ThY 



On the other hand, observations of galaxy rotation curves have revealed the presence 
of a central core (e.g. [1771 11781 11791 11801 1181j ). Cores could be created in the mass 
distribution if the DM particles are self-interacting [169) , or if galaxies are able to pump 
energy into the DM component via dynamical friction [1461 1182) . Cored mass density 
models have been suggested [1831 1184] , such as the Burkert profile, 



(8) PBurkert — 



Pa 



(l + r/r,)[l + (r/r,)2 



characterized by the core radius Tc- Another widely used mass profile is the softened 
isothermal sphere, 

fc\\ Po 
(9) Psis = 



1 + (r/rc)2 



Note that at large radii p Burkert oc r^^ like the NFW profile, while psis c< r~'^ . 

Given the problems that the NFW model has at small scales, it is important to test 
its validity on larger scales, i.e. on cluster- and group-sized halos. Hence determination 
of the cluster mass profile, M{r), becomes a crucial test for the CDM cosmological model 
(see Sect. [2^. 
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2'1. Dynamical analysis: methods. ~ The most commonly used method to determine 
the mass profile M (r) of galaxy clusters is the Jeans method (see [86l) hereafter described. 

Assuming spherical symmetry, the projected number density profile N{R) of a tracer 
can be uniquely deprojected via the Abel inversion equation |86j . 



(10) 



j/(r) 



dN dR 
'dR\/W~^' 



where v{r) is the 3-d number density profile, R and r are the projected and, respectively, 
the 3-d radius (i.e. the distance from the cluster center). While this deprojection is 
straightforward, deprojecting the line-of-sight velocity dispersion profile, a^^R), requires 
knowledge of the velocity anisotropy profile, 



(11) 



/3(r) = 1 - 



<vt>(r) 
<v2>(r) ■ 



where <v^>(r), <v^>(r) are the mean squared radial and tangential velocity compo- 
nents, which we can write as cr^ (r) and (r) respectively, in the absence of bulk motions 
and net rotation. In the simplest case of the isotropic velocity distribution, j3{r) = 0, 
the deprojection reads 



(12) 



al{r) 



d[N<jl] 



dR 



dR 



Through a more complicated set of equations it is possible to deproject ay in the case of 
generic /3 |185j . 

Given arir) and (3{r) it is possible to determine the mass profile, M(r), through the 
Jeans equation for a coUisionless system of particles (e.g. galaxies). 



(13) 
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Similarly, given M{r) and (3{r) it is possible to determine the observed, projected phase- 
space distribution of the tracers via 



(14) 



z/(r)a^(r) = G 
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From the eqs. above it is clear that the same observed number density and velocity 
dispersion profiles N{R) and <Ty{R) can be obtained by a different combination of the 
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mass and anisotropy profiles M(r) and (3(r), and vice versa. Symmetrically, given the 
observables N{R) and (Ty{R) it is possible to obtain M(r) only if P{r) is known and 
(3{r) only if M{r) is known [185l [THS [Wl [Hg . This is the so-called "mass-anisotropy" 
degeneracy. 

In order to break this degeneracy, one must constrain (3{r) independently from M{r). 
A possibility is to build distribution function models (see, e.g., [189] ) and use them to 
compute the probability that a particle observed at a given projected radius R have a 
line-of-sight velocity w in a given interval dv. These probabilities are then used in a 
maximum likelihood analysis to determine the model that best represents the observed 
projected phase-space distribution of galaxies. The best-fit model can also be chosen 
by comparing the line-of-sight velocity distribution predicted by the model with the 
observed histogram of velocities of cluster galaxies. The comparison in this case can be 
done by considering moments of the velocity distribution higher than the second (e.g. 
[Ml 11891 11901 I191j ). Robust estimates of these moments are obtained by the use of 
Gauss-Hermite polynomials. 

Another way to break the mass-anisotropy degeneracy is to consider several indepen- 
dent tracers of the gravitational potential, then apply the Jeans procedure independently 
for each of the tracers. Subject to the constraint that different tracers should provide 
identical M(r) solutions, it is possible to reduce the range of acceptable [M{r), (3{r)] (see, 

e.g., mi)- 

The Jeans procedure outlined above also assumes that the system is in dynamical 
equilibrium. However, since clusters grow by accretion of field galaxies (e.g. [1931 llOOl l 
they are not steady-state systems. One should then include the time derivative term in 
the Jeans equation (eq. 4-29c in [86]). Fortunately, the rate of mass accretion onto low-z 
clusters is small, ~ 8% of the total mass in a dynamical time (^ 1/10 the Hubble time) 
|194j , although it increases with redshift [195j . Since the accretion process is not smooth, 
some clusters, even at low-z, may be observed during an intense accretion phase. These 
clusters are nevertheless easy to spot, since most of the accreted mass is in the form 
of groups [196j which can be identified as substructures in the projected phase-space 
distribution of cluster galaxies (see Sect. I2"5p . More problematic is the case of galaxy 
groups [197] and of high-z galaxy clusters, since several of these systems are likely to be 
detected when they are still in their collapse phase and far from dynamical equilibrium. 

Other usual assumptions of the Jeans analysis are sphericity and the absence of net 
rotation. Deviation from spherical symmetry has been shown not to be a major problem 
for individual clusters [1891 1198| and there is little if any evidence for net rotation in 
galaxy clusters [199| . 

The Jeans equation applies to a coUisionless system of particles. Galaxies do behave 
as quasi-collisionless particles when they move at high speed, which is the case in galaxy 
clusters. High-speed galaxy encounters produce little tidal damage and do not lead to 
mergers [200] . and only the most massive galaxies have their motions slowed down by 
dynamical friction [20 Ij . As the mass of the host system of galaxies decreases, dissipative 
processes become more important. Groups in particular, are very favorable sites for 
galaxy mergers [139] so the coUisionless Jeans equation may not be applicable for these 
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systems [202] , 

Interlopers are another serious problem when one is trying to determine a cluster mass 
profile. Interlopers are foreground/background galaxies that happen to lie in the same 
projected phase-space region occupied by cluster galaxies. Several methods exist to get 
rid of them. Tests on cluster-scale halos extracted from cosmological simulations have 
shown these methods to perform relatively well (see in particular [97 t I203|, I204j ). There 
are however interlopers that are impossible to distinguish from real cluster galaxies; in 
order to deal with these interlopers a statistical approach is generally adopted (e.g. [203]). 
In the statistical approach the projected phase-space distribution of galaxies observed in 
the cluster region is assumed to be contaminated by a certain fraction of interlopers with 
a well known spatial and velocity distribution (inferred from the analysis of simulated 
halos). Alternatively, one can use galaxy internal properties, such as, e.g., colors, spectral 
types, and morphologies, to improve the separation between cluster and field galaxies. 

In order to determine a reliable cluster mass from the projected phase-space dis- 
tribution, > 60 galaxies are needed [97], but about an order of magnitude more are 
required for the determination of a cluster mass profile (see, e.g., P05j ) . Spectroscopic 
samples of several hundred member galaxies per cluster are still rare, hence it is common 
practice to build a 'composite' cluster by stacking together the data of several clusters 
(see, e.g., [HUITMllinilinZlIinHl)- The projected phase-space distributions of different 
clusters c an b e put together by scaling galaxy radii and velocities with virial quantities 
(^200 J ^^2oa{lj) ■ This procedure is supported by the results of cosmological numerical 
simulations, that suggest that halos at the cluster mass scale are a quasi-homologous 
family of objects, their mass profiles changing only slightly with the halo mass [17211209] . 

By stacking clusters together it is possible to deal with samples of a few thousand clus- 
ter galaxies. These samples are obtained from cluster-dedicated spectroscopic surveys, 
such as EN ACS [HIS] and the CNOC [21011211] . and also from field spectroscopic surveys 
wherein clusters have been identified, such as the 2dFGRS [2121 1213] and the SDSS (see 
e.g. [5511214] ). The most recent cluster-dedicated spectroscopic surveys are the Las Cam- 
panas/ Anglo Australian Telescope Rich Cluster Survey (LARCS [215] ). the Cluster and 
Infall Region Nearby Survey (CAIRNS [216]), the Wide Field Nearby Galaxy-clusters 
Survey (WINGS [313), and the EDisCS [TJ. 

In recent years, another technique, usually referred to as the Caustic method ( [2181 
1219] ) has been developed. In this method, a cluster M(r) is obtained from the amplitude 
of the caustics delineated by the projected phase-space distribution of galaxies in the 
cluster region. This amplitude is related to the gravitational potential through a function 
of the projected radius, R, of the gravitational potential, and of /3(r) (see eqs. 9 and 
10 in [219] ). Also this method suffers from the mass-anisotropy degeneracy, but only 
in the central cluster regions, since numerical simulations indicate J- ~ const at large 
radii, R > r2oo (see Fig. 2 in 219J). Hence the Caustic mass estimate is relatively robust 



(^®) The so-called circular velocity W2oo is defined as «2oo ~ (C?M2oo/''20o)^''^ where M200 = 
(47r/3) 200pc rioo. 
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Fig. 6. - The DM circular velocity profile (T4 = i/GM/r) of the stacked cluster from the ENACS 
data-set |227| (points with 1-cr random + systematic error bars). The best-fitting NFW and 
Burkert models are indicated with a blue and red line, respectively. 



at large radii, exactly where the Jeans estimate may be more affected by problems of 
interlopers and deviations from dynamical equilibrium. On the other hand, the Jeans 
method is more robust at small radii, where imperfect knowledge of J- increases the 
systematic uncertainty in the Caustic mass estimate. 

The Caustic and Jeans methods have been shown to produce consistent cluster mass 
profiles [2061 1216j . Through analyses based on samples of cluster-sized halos extracted 
from cosmological numerical simulations, both methods have been shown to be reliable 
[2191 11981 1205j . There are only a few direct comparisons of mass profiles determined 
from the distribution of cluster galaxies with those determined using the X-ray-emitting 
intra-cluster gas or via gravitational lensing. In general, lensing mass profiles are in 
agreement with those determined via the Caustic [220] and Jeans [22lj methods. On 
the other hand, the agreement is less good between X-ray determined mass profiles and 
those determined from the distribution of cluster galaxies [2201 1222] . 

2'2. Moss profiles. - Application of the Caustic technique to the CAIRNS sample 
|223l 12241 12251 12161 1124] has shown that the cluster mass density profile p{r) resembles 
the NFW model, except at large radii, r > 2r2oo, where its slope seems to be somewhat 
steeper (between —3 and —4). SIS models are rejected. The best-fit values of the NFW 
concentration parameter range between 5 and 17. Very similar results have recently 
been obtained by application of the same Caustic technique to a new sample of 72 X-ray 
selected clusters extracted from the SDSS (the CIRS sample [214] ). 

A combination of the Jeans and Caustic analysis was used to determine the mass 
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profile of a composite cluster extracted from the 2dFGRS |206| . By stacking together 43 
nearby clusters, a total sample of 1345 cluster galaxies was obtained. Late-type galaxies 
(LTGs in the following) were excluded from the sample, and isotropy was assumed for 
the Jeans analysis. The resulting M(r) was found to be well described by a c ~ 6 NFW 
profile over the radial range 0-2 r'2oo- If a cored profile is fitted to M(r), the core radius 
is constrained to be small, < 0.1r2oo, i-e. not much larger than the size of the BCG 
which generally sits at the cluster center. 

A composite of 1129 ETGs was constructed from 59 clusters from the ENACS sample 
[207j . By comparing the velocity distribution of cluster ETGs with distribution function 
models, stringent constraints on (3 were obtained. The ETGs were shown to move on 
nearly isotropic orbits, hence f3 = was adopted in the Jeans analysis. It was found that 
p (X 7--2-4±o.4 J. — 7-200, fuUy consistcut with the NFW asymptotic slope. Two models 
were shown to provide an adequate fit to the data, a NFW profile with c — 4 ± 2, and a 
Burkert profile with a rather small core radius (< 0.1 7^200 )■ The M(r) solution obtained 
by using ETGs as isotropic tracers was later confirmed by using another tracer of the 
gravitational potential, i.e. cluster Sa-Sb galaxies |;226 . 

The mass distributions of a few individual nearby clusters (including Coma) have also 
been determined [1901 12051 1204j . In order to break the mass-anisotropy degeneracy not 
only the projected velocity dispersion profile but also the velocity kurtosis profile were 
derived and modeled. All cluster mass profiles turned out to be well described by an 
isotropic NFW model, with a median(c) = 7. 

The ENACS data-set was re-analyzed to determine the relative contributions of 
baryons and DM to the total mass profile |227| . Since the DM contribution is domi- 
nant, the resulting DM M{r) does not differ significantly from the total mass M{r), it is 
only slightly more concentrated (NFW c = 5± 1, Burkert Vc — 0.12 ±0.02; see Fig.lH]). If 
the subhalos DM contribution is subtracted from the whole DM component, what is left 
is the diffuse DM associated with the main halo (cluster) , which appears to be even more 
concentrated (NFW c = 7 ± 1, Burkert = 0.09 ± 0.02). Note however that splitting 
the DM M(r) into its halo and subhalo components is very model-dependent. 

These results show that central cuspy models such as NFW and Hernquist provide 
an adequate fit to the mass profile of nearby galaxy clusters. The Burkert profile is also 
acceptable, as far as the core radius is small, of order the size of the BCG or smaller. Since 
galaxies are treated as point-like tracers of the potential in the Jeans analysis, the size of 
the core radius is close to the resolution size of the analysis. The upper limit on the size of 
the core radius can be used to constrain the DM scattering cross section by comparison 
with simulations [170j . The resulting upper limit, < 2 cm^ g~^, effectively rules out 
Self-interacting DM as a possible way to explain the cored mass density profile of dwarf 
galaxies [1691 1228] . The absence of a significant core in the cluster mass distribution also 
implies that dynamical friction is not very effective in transferring cluster galaxy kinetic 
energy to DM particles [182] . This is consistent with observational estimates of galaxy 
luminosity segregation in clusters (see, e.g., [99]). 

While the inner slope of the density profile is essentially unconstrained, at large radii 
the asymptotic slope of the density profile is constrained to lie between —3 and —4, 



Galaxy systems in the optical and infrared 



23 



2.0 
1.5 

\ 1.0 



0.5 
0.0 

5 10 15 20 25 30 

c 

Fig. 7. - The velocity anisotropy Orjat vs. the NFW concentration c = r2oo/fs parameter as 
determined for a stacked sample of 26 X-ray-emitting small groups, mostly from the GEMS 
sample [139^ . l-a contours are shown [234) . 



consistent with the NFW, Hernquist, and Burkert models, but not with the SIS model. 

At higher redshifts the constraints are less strong. Results based on the analysis of 16 
stacked CNOC clusters confirm that NFW is an acceptable mass profile model on cluster 
scales also at 0.17 < z < 0.55 [TgTl 1^1X55] , 

The CDM-motivated NFW model fits well the mass profiles of cluster-sized halos, 
and does not fit the mass profiles of galaxy-sized halos. Hence it is important to test 
the model at intermediate scales (groups of galaxies) . Unfortunately, the results for the 
group M(r) are still controversial so far. From the analysis of 588 galaxies in 20 stacked 
groups the group density profile was found to be consistent with the Hernquist model 
[230j , but using a sample twice as large the same authors concluded that a single power- 
law model is a better representation of the data |231| . Both results are inconsistent with 
those obtained for a higher redshift group sample (0.1 < z < 0.55) whose average mass 
density profile is characterized by a very shallow-slope and a central core |42j . 

Discrepant results probably arise as a consequence of different selections of the group 
samples, since not all groups are dynamically virialized systems |1971 1232[ I200[ [230] . A 
proper characterization of the group mass profile awaits a careful definition of the group 
sample, based on the group characteristics. Such a sample may be provided by the 
groups of the Group Evolution Multiwavelength Study (GEMS '139') for which both X- 
ray and optical data are available. Comparing optical and X-ray group properties helps 
constraining the group dynamical status [129j . A preliminary analysis of this sample 
suggests consistency of the average group mass profile with the NFW model, with a 
higher concentration parameter than for clusters, in line with predictions from ACDM 
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Fig. 8. - The NFW concentration c = r2oo/''s parameter vs. the virial mass M(r2oo) as de- 
termined on four samples of groups and clusters [2341 12301 12061 1227| characterized by different 
average masses (points with 1-a error bars). The solid and dashed lines indicate the median 
and ±1-(T predictions for a ACDM cosmology with WMAP5 cosmological parameters [235] . 



(see Fig.H [23311234] ). 

In Fig. [8] results for the best-fit NFW concentration parameter are collected from 
the literature. The c — c{M) trend is somewhat above recent predictions from ACDM 
cosmological numerical simulations |235j . 

2'3. The relative distribution of dark and baryonic matter. - In order to answer the 
question raised in Sect. [U i.e. are galaxies distributed like the DM, the mass density 
profiles of galaxy systems must be compared with their galaxy number- or luminosity- 
density profiles. These can be evaluated by counting galaxies or, respectively, summing 
galaxy luminosities, in concentric annuli around the cluster center, taking into account 
the completeness correction, if needed. The mass-density to luminosity- (or number-) 
density profiles ratio is called the mass-to-light (M/L) profile. 

Since different cluster galaxy populations have different distributions (see Sect. 13 ip 
there is not a unique M/L profile. Depending on the photometric band used to select 
the cluster galaxies, the relative fraction of red and blue, quiescent and star-forming, 
early- and late-type galaxies in the resulting sample may vary. Modulo the selection 
in type or color, the M/L profiles found by different authors are generally in agreement 
pUSllWlfnil^ . The cluster M/L profile increases from the center to 0.2 raoo, flattens 
out to r200i and then decreases, by a factor x2 out to the turnaround radius. The 
trend near the center is caused by the presence of the BCG which sits at the bottom 
of the cluster potential well. The external, decreasing trend is instead caused by the 
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increasing fraction of late-type, blue galaxies with radius. Selecting only the red, early- 
type, quiescent cluster galaxies (or selecting galaxies in the K band) flattens the M/L 
profile in the outer parts. Removing the BCG flattens the M/L profile near the center. 
Hence, the light of red cluster galaxies (except the brightest one) does indeed trace the 
mass, but the light of blue cluster galaxies does not. Applying the virial theorem to the 
distribution of red cluster galaxies (BCG excluded) should then provide unbiased mass 
estimates for dynamically relaxed clusters [57]. 

While galaxies are useful tracers of the cluster potential, they are by far a negligible 
component not only of the cluster mass, but also of the baryonic mass. Most of the bary- 
onic mass is in the intra-cluster, X-ray emitting gas. The intra-cluster gas-to-total mass 
fraction increases with radius as r° '' |227j . Hence, the baryonic mass is less concentrated 
than the total mass at all radii, except near the very center, where the baryons of the 
BCG dominate the mass budget (see Fig. IH]). 

Exploring the evolution of the M/L profile with redshift can provide useful informa- 
tion on when and how the different galaxy populations settle in galaxy clusters. The poor 
constraints existing so far for clusters at z « 0.3 seem to confirm that M / L k, constant 
out to the virial radius, when only red galaxies are selected |189j . 
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Since the average mass profile of low-mass galaxy systems (groups) is not yet well con- 
strained, results on their M/L profile are controversial [2301 142j. Cosmological numerical 
simulations indicate that groups have more concentrated mass profiles than clusters (see 
e.g. [1721 1209"] ). while observations indicate that groups have less concentrated galaxy 
number density profiles than clusters |126j . hence groups might be characterized by 
steeper M/L profiles than clusters. 

2'4. The orbits of galaxies and mass accretion. - According to the hierarchical model 
for the formation and evolution of cosmic structures, clusters grow from accretion of 
galaxies and galaxy groups from the field. CDM cosmological numerical simulations have 
shown that DM particles accrete onto clusters on moderately radially elongated orbits, 
i.e. with a radial velocity anisotropy that increases moving out to the virial radius (e.g. 
[2361 12371 12191 I238j ). Observational evidence supporting the hierarchical build-up of 
galaxy clusters has been provided by the discovery that cluster ETGs and LTGs have 
different kinematics [1931 12391 12011 11511 1100] . LTGs are characterized by a larger (T„ than 
ETGs, and this has been interpreted as evidence that LTGs are an infalling, unvirialized 
population. However, kinematical evidence alone cannot prove the LTGs are indeed an 
infalling population, full dynamical modeling is required (see Sect. [2T]) . 

One of the first full dynamical modeling of a cluster was made in the early 80s for 
the Coma cluster [240) . It was concluded that the galaxy orbits are not primarily radial. 
Consistently, many recent dynamical modeling of low-z galaxy clusters, mostly based on 
stacked cluster samples from the ENACS, CNOC, and SDSS data-sets, have concluded 
for quasi-isotropic orbits of ETGs [2291 [2301 [2411 [1901 [203 • Since ETGs are the dominant 
cluster galaxy population, also the mean velocity anisotropy (see eq.[TT|) of cluster galaxies 
altogether is found to be /3 « [1891l216ll205j . Interestingly, full dynamical analysis does 
not support the interpretation of LTGs as an unvirialized infalling population. Probably 
the details of the results depend on how accurately are interlopers rejected from the 
sample of cluster members. Anyway, both for nearby and medium-z clusters LTGs are 
found to be in dynamical equilibrium within the cluster potential. At variance with 
ETGs, however, LTGs have moderately radially anisotropic orbits [2291 Hill [2261 
1242] . with an anisotropy that increases with radius [2261 12331 1243j (see Fig. [TU)) . A finer 
distinction of the LTG population into two classes, Sa-Sb on one side and Sbc-Irr on 
the other, has shown that the radial anisotropy is characteristic of the latter class only, 
while the orbits of Sa-Sb are isotropic within observational uncertainties [226] . 

The velocity anisotropy profile of LTGs is remarkably similar to that of DM particles 
in clusters extracted from cosmological numerical simulations (see, e.g., [2371 12191 1238j ). 
The orbital characteristics of DM particles are reminiscent of their almost radial accretion 
onto clusters along the surrounding filaments. By analogy, also the predominantly radial 
orbits of LTGs can be taken as an indication that these galaxies retain the dynamical 
memory of their infalling motions into the clusters. They are probably newcomers of the 
cluster environment, where they have spent too little time for the dynamical memory of 
their initial infall to be totally erased. ETGs, on the other hand, have had time to undergo 
sufficient energy and angular momentum mixing, capable of isotropizing their orbits 
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Fig. 10. - A preliminary determination ol the velocity anisotropy profiles of the red and blue 
galaxies in the CIRS clusters and their ±l-a confidence levels [243] . 

(see, e.g., |244| and references therein). Such energy and angular momentum mixing 
occurs in galaxy systems mostly via phase- and chaotic-mixing or violent relaxation 
[2451 12461 12471 12481 1249] , which occur when the system gravitational potential changes 
rapidly, i.e. at the time of the system assembly or on the occurrence of major mergers 
[2501 12511 1252] . Another process capable of isotropizing galaxy orbits is the secular 
growth of cluster mass [lOlj . 

LTGs have probably entered the cluster environment after the last major merger. 
Consistent with this hypothesis is the fact that they still retain most of their gas content, 
which cluster-related environmental processes will eventually strip given sufficient time 
(see Sect. I3'6)) . Numerical simulations confirm that recently accreted satellites in a host 
halo have more radially extended and less bound orbits [2531 1254j . 

Independent, direct evidence for the accretion of field spirals (S) onto clusters has been 
obtained from the analyses of the distance-velocity diagram around the Virgo clusters 
and other nearby galaxy systems [2551 12561 1257j . Unfortunately, distance measurements 
are affected by large uncertainties and cannot yet be used to assess the infall process in 
a statistically significant sample of massive clusters. 

There is no direct estimate of the evolution with z of the ETG and, separately, the 
LTG (3{r). However, an indirect argument can be used to rule out significant evolution 
at least up to z ~ 0.3. The projected phase-space distributions of ETGs and, separately, 
LTGs in the z ~ 0.1 ENAGS clusters and in the z ~ 0.3 GNOC clusters are remarkably 
similar [233] , except for the normalization (the fraction of blue galaxies is higher in more 
distant clusters, the so-called Butcher-Oemler effect - see Sect. 13 2p . Also the average 
M (r) of ENACS and CNOG clusters are very similar [207lfT89] . Similarity of the observed 
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projected phase-space distributions and of the mass profiles then imply, from the Jeans 
analysis (see Sect. I2'l|) . similarity of the velocity anisotropy profiles. This implies, in 
particular, that, just like their low-z counterparts, also the CNOC LTGs are newcomers 
of the cluster environment. Given that their fraction decreases with time, cluster LTGs 
must either transform in ETGs or dim with time (or both) . If they do undergo color and 
morphological transformation, they need at the same time undergo orbital isotropization 
since ETG orbits are isotropic and LTG orbits are moderately radial. This observation 
could prove useful in constraining the mechanisms that drive galaxy evolution in clusters 
(see Sect. [3^ . 

The mass accretion rate can be estimated by measuring the mass outside the virial 
cluster region which is bound to infall into the cluster in the future. Using the Caustic 
method it has been estimated that Af(r < r2oo) ~ -^(''200 < r < rt), where rt ~ 5r2oo 
is the turnaround radius, i.e. the characteristic radius that separates accretion from 
outflow regions [214] . According to ACDM cosmological N-body simulations, clusters 
will reach their final M200 rnass when they are 32 Gyr old |258j . The Caustic estimate 
then implies a future average accretion rate of M « 0.06 M2oo/Gyr. This estimate is 
in excellent agreement with an independent estimate obtained by summing up the mass 
of recently accreted groups in the Coma cluster [194j . M ~ 0.02 — 0.11M2oo/Gyr. The 
analysis of the density profiles of cluster galaxies in different redshift bins has allowed 
estimates of the stellar mass accretion rates [195] , Mi, ~ 0.06 Af*/Gyr at z = 0.45 and 
Mi, ~ 0.02 Mj,/Gyr at z = 0.20, implying a significant z-evolution of the mass accretion 
rate. These estimates depend on the timescale for halting star formation (SF in the 
following) in the accreting blue galaxies, assumed here to be 1 Gyr. A timescale of 
~ 0.5 Gyr would double these estimates, and make them more similar to those obtained 
by the other techniques mentioned above. 

2'5. Subclusters. - In Sect. I2'4I evidence has been provided that clusters evolve by 
accreting galaxies from the surrounding field. A large fraction of field galaxies occur 
in groups, hence in fact clusters not only accrete isolated galaxies but also groups of 
galaxies. When groups of galaxies enter the cluster environment they are subject to 
tidal forces that tend to disrupt them (see, e.g., |259[ 12601 1261) ). The time of disruption 
is longer for less massive groups |2601 1261] . that can resist for several Gyr [2601 1262j . 
Observationally, groups accreted by clusters and not yet totally disrupted will appear 
as secondary, statistically significant overdensities in the distribution of cluster galaxies. 
They are usually called "subclusters" or cluster "substructures" (see |263) for a review). 

In reality, not all observationally-identified subclusters are the surviving remnants of 
galaxy systems which have been accreted into a cluster gravitational potentials. Groups 
that are in the cluster foreground or background can also be identified as local galaxy 
overdensities, and hence confused with real subclusters. Among the foreground and 
background groups, those that are dynamically bound to the cluster will eventually be 
accreted and become real subclusters in the future [264] . 

It is important to identify and study subclusters. They provide information on the 
accretion process itself and ultimately serve as constraints to cosmological models (see. 
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Fig. 11. - The central region of the Coma cluster (1.13 x 0.65 Mpc^ at the cluster redshift, 
z = 0.0233) displaying a clear bimodal distribution of galaxies concentrated around the two 
BCGs (from http//www.mistisof tware . com/astronomy/Galaxies_ComaCluster .htm) . 



e.g., |265[ 1266] ). For instance, the total mass of subclusters in the Coma cluster region 
(see Fig. [TT]) has been used to provide an estimate of the Coma cluster accretion rate 
(see Sect. I2'4p . The distribution of subcluster luminosities has been compared to the 
distribution of subhalo masses thereby providing a test for theories of structure formation 
[267j . In the so-called 'bullet' cluster the distributions of the total and baryonic masses 
are displaced as the result of a very energetic collision of the cluster with an infalling 
group [268 . A measure of this displacement has been used to set an upper limit to the 
cross-section of DM particles [26911270] . 

Another aspect of the importance of subcluster studies is that internal cluster dy- 
namics can be affected by the very energetic (~ 10^^ J) cluster-subcluster collisions. 
Part of the energy and of the angular momentum of the collision is transferred to the 
cluster and subcluster galaxies. As a consequence, the collision affects the velocity distri- 
bution of cluster members, typically broadening, skewing, and/or flattening an initially 
Gaussian distribution ,2711 1272] , but also generating mean velocity gradients along the 
collision axis |273[|274] . Also the spatial distribution of galaxies is affected, and becomes 
less centrally concentrated |2721 [M] . As a consequence, cluster masses and velocity dis- 
persions are generally overestimated during and some time after the collision (see, e.g., 
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[275[ 12721 [97] ). Typically, masses are overestimated by ~ 30%, but depending on many 
parameters (e.g. the relative angle of the collision and line-of-sight axes, the mass ratio of 
the subcluster and main cluster, the time after the collision, the observational sampling, 
etc.) the mass can be overestimated by up to a factor ~ 5 or even itnrferestimated by a 
factor ~ 3 in extreme cases |272| [97] . 

Cluster-subcluster collisions can also have important effects on galaxy properties. 
They produce rapid variations in the cluster gravitational field that can stimulate non- 
axisymmetric perturbations in the galaxies involved in the collisions, and increase the 
rate of galaxy-galaxy interactions, leading to bursts of SF |276l 1277] 12781 \TI^ . Moreover, 
collisions are likely to displace the central BCG from the bottom of the cluster potential 
well, thus effectively halting the accretion process of satellite galaxies onto the BCG (see, 

e.g., [IHOIIMIIIMI). 

Several methods have been developed to identify subclusters in the optical (see, e.g., 
[272] and [263] ). Depending on the data-set, the different methods are more or less 
effective. Generally speaking, all these methods look for deviations from symmetry in 
the spatial and/or velocity distribution of cluster galaxies, or for significant secondary 
peaks in the surface density or projected phase-space distributions. 

The most widely used of these techniques has been developed in the late 80s |282] . 
In its original formulation, the method consists in considering all possible subgroups of 
10 neighbors around each cluster galaxy. The mean velocities and velocity dispersions 
of all these subgroups are calculated, as well as their differences with respect to the 
corresponding global cluster quantities. The sum of the squares of these differences 
constitutes the A statistics. 



where ay and v are the velocity dispersion and mean velocity of the whole cluster, ai and 
Vi are the corresponding quantities for any group i of 11 galaxies, and the sum is over 
all N cluster galaxies. Montecarlo simulations are then run to establish the statistical 
significance of A. 

After its original formulation, this method has been modified and adapted by several 
authors |283ll284l [99] . This was done in particular for extending the scope of the method, 
initially meant to estimate the probability that a cluster contains subclusters, and later 
adapted to find which galaxies have the highest likelihoods of residing in subclusters. 

Among clusters analyzed with the A technique, ~ 1/3 show significant evidence for 
subclustering, both at low and medium-to-high redshifts (see, e.g., |100[|285j ). A similar 
fraction has been found with other techniques (see, e.g., [2861 1115] ). but this fraction 
is likely to be a lower limit, since other, deeper analyses have discovered subclusters 
in clusters previously thought to contain none (see, e.g., [2841 1267] ). Presumably any 
cluster would show evidence for subclustering if examined in sufficient detail, because 
of the very nature of the process by which these objects form (hierarchical clustering). 
Today's analyses are aimed at determining not the fraction of clusters with subclusters. 
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but the mass distribution of subclusters |267j and its redshift evolution, and to compare 
it with the prediction of cosmological numerical simulations, in order to learn about the 
process of structure formation and evolution. 

Other constraints can come from comparing the observed distribution of subclusters 
in clusters with that predicted by cosmological models. Observational estimates of the 
number density profile of subcluster galaxies agree with estimates from ACDM cosmo- 
logical numerical simulations [2261 1261j . Moreover, the subcluster orbits are found to be 
tangential |226| , which is consistent with the idea that subclusters on radially elongated 
orbits are selectively destroyed by tidal effects |2531 1287] . 

2'6. Summary and perspectives . - The analyses of cluster dynamics based on galaxies 
as tracers of the gravitational potential have come to the following conclusions. 

• M{r) is consistent with the prediction of CDM cosmological numerical simulations. 

• A cored M{r) is not excluded, but the core, if exists, has to be small, of order the 
size of the central bright galaxy, essentially ruling out self-interacting DM as a way 
to explain galaxy rotation curves. 

• At large clustercentric radii, r > 2r2oo, the mass density profile slope is ~ — 3.5±0.5, 
still consistent with NFW, but somewhat steeper. 

• The red/early- type/passively-evolving cluster galaxies are characterized by nearly 
isotropic orbits, while the blue/late- type/star- forming cluster galaxies have increas- 
ingly radial anisotropy with increasing clustercentric radius. 

There is a lot that remains to be done and that will be made possible by exploiting 
already existing and forthcoming databases, such as the Imacs Cluster Building Sur- 



vejIdZj. The constraints that have been obtained so far can be put on a more solid 
statistical basis. E.g. it should be possible to rule out either the cuspy NFW or the 
cored Burkert with a ~ 5 times larger data-set than the ones used so far. The currently 
loose constraints on the relation between mass and concentration can be made tighter, 
in order to confirm or reject the apparent (albeit marginal) discrepancy with the theo- 
retical predictions |2881 1235] . While constraints on the shape of c = c(M) relation can 
only come by sampling the group mass scales, constraints on the normalization can also 
come by sampling the cluster mass scales alone, where kinematical methods are more 
powerful (because the number of available galaxies per cluster is larger). On the other 
hand, constraining the group M{r) is extremely important by itself, since group masses 
are intermediate between cluster masses, where cosmologically motivated M{r) models 
appear to work, and galaxy masses, where they do not. 

Individual cluster samples with > 500 galaxy velocities are currently rare. With fu- 
ture, larger samples it will become possible to constrain individual cluster concentrations 
and eventually compare the c-distribution of a complete cluster sample with theoretical 



i^"^) see http : //www . ociw.edu/research/ adressler/ 
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predictions, which indicate a skewed distribution (e.g. |289j ). Maybe it will even be 
possible to check whether it is indeed the concentration that changes from cluster to 
cluster, or whether it is the shape of the density profile [290[|291| . 

Models and simulations indicate that cluster mass profiles depend on their accre- 
tion history [IMl [Ml MM HHSl USS]- This could be tested by characterizing the mass 
profiles of galaxy clusters as a function of their degree of internal dynamical relaxation 
(or, inversely, of subclustering). Determining the detailed properties of subclusters in 
a complete cluster sample can provide another interesting test of cosmological models. 
Knowledge of the cluster and subcluster masses, and their relative velocities is required 

With a substantially larger data-base that the ones used so far, it will also be possible 
to determine orbital constraints for several classes of cluster galaxies, distinguished by 
color and internal structure. These are galaxy properties that are unlikely to evolve 
simultaneously (see Sect. 13 '2]) and orbital isotropization can be used as a clock for galaxy 
evolution in clusters. It is also important to determine the orbital characteristics of 
different cluster galaxy populations as a function of cluster mass and/or cluster structure, 
since orbital isotropization may occur at different epochs and proceed with different 
speeds depending on the global cluster properties. As a matter of fact, in some clusters 
ETGs and LTGs move with similar orbital anisotropies [242j . 

Numerical simulations and analytical models predict that the shapes of the mass- 
density and anisotropy profiles of cosmological halos are closely related |298l 12991 1300j . 
With current data-sets it is not possible to independently constrain cluster mass- and 
anisotropy-profiles to the accuracy level required to test this predictions, but with future, 
larger data-sets, it will. 

Investigating the redshift evolution of cluster mass- and anisotropy-profiles and of 
cluster mass accretion can provide interesting constraints on the hierarchical model of 
cluster formation and galaxy evolution in clusters. Cosmological numerical simulations 
predict two phases of CDM halo assembly. The initial gravitational collapse phase is 
responsible for establishing the central slope of the mass density profile, while the external 
slope is established by the later accretion phase [244[ I249j . It should then be possible 
to test this evolutionary scenario by determining the mass density profiles of clusters at 
different redshifts. 

As far as the mass accretion rate is concerned, this is predicted to increase with z 
by cosmological simulations (e.g. |301j ). in a way that depends on the cosmological 
parameters as and flm, and which can be observed as an increase of subclustering or 
of cluster ellipticity (see, e.g., [2651 1302] ). The evolution of cluster subclustering in 
different redshift bins can therefore be used as a cosmological probe. Features indicative 
of massive subclustering seem to be found more often in distant clusters than in nearby 
ones |2631 1303) . as expected. Although some results on the ellipticity evolution with 
redshift have been obtained from samples of nearby clusters [30411305] . a more systematic 
analysis of a well-controlled sample of clusters over a wide redshift range is still lacking. 
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Fig. 12. - The projected phase-space distributions of different galaxy populations in ENACS 
clusters. Top panel: Es and SOs; middle panel: early-type S; bottom panel: late-type S and 
Irr. Galaxy clustercentric distances have been scaled with the cluster virial radii, and galaxy 
velocities relative to their cluster mean velocities have been scaled with the cluster velocity 
dispersions, in order to stack 59 cluster samples together. Galaxies in subclusters have been 
removed from the sample. From [99j . 

3. Properties of cluster galaixy populations 

The most striking characteristics of the cluster galaxy population is its morphology 
mix. Nearby clusters contain a larger fraction of ETGs than of LTGs, exactly the opposite 
of what is observed in the general field. The morphological difference is so striking that it 
was noted even before the extragalactic nature of nebulae was established (see, e.g., jTj). 
Today, a modern version of the relation between galaxy properties and their environments 
is expressed by the segregation of different cluster galaxy populations in projected phase- 
space (see Fig. [T2]) . ETGs populate higher-density cluster regions than LTGs, which are 
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also characterized by a larger at all radii. The very bright ellipticals (Es) occupy 
denser and lower-velocity regions than any other cluster galaxy population. Within the 
LTG population, the projected phase-space distribution of early-type spirals (Sa~-Sb) is 
more similar to that of the ETGs than to that of late-type spirals and irregulars (Irr) . 
Finally, galaxies in substructures appear to be avoiding the cluster centers and to move 
at relatively low velocities. 

Why do galaxies care about their environment? Why different populations of galaxies 
occupy different regions of the phase-space? By analyzing the properties of galaxies as a 
function of their environment and of redshift it is possible to constrain the mechanisms of 
galaxy formation and evolution (see, e.g., |306| ). Hereafter, the observational properties 
of galaxies in and around clusters, as a function of their environment and redshift are 
presented. Physical processes capable of shaping galaxy properties in clusters are then 
reviewed. Finally, an evolutionary scenario based on the observational phenomenology 
is proposed. 

3T. Morphology. - The galaxy morphology mix changes with some regularity over 
many decades of projected galaxy number density, the fractions of Es and SOs increase 
with increasing local density and the fractions of S and Irr decrease |307[ I308| . This is 
the so-called "morphology-density relation" (MDR in the following). Given the strong 
anti-correlation between local density and clustercentric distance (radius), the MDR is 
often reported as a variation of the morphological fractions with radius (see, e.g., [309j ). 

A less strong correlation exists between the morphologies and the velocities of cluster 
galaxies, Es and SOs having a narrower velocity distribution than S and Irr ( [3101 [TMl 
12391 1311j ; see also [T] and references therein) . The MDR and the morphology-velocity 
relation together can be viewed as a segregation of different cluster galaxy populations 
in projected phase-space [5^ . 

The MDR is present at least up to z ~ 1 in massive clusters |3121 1313] . while in 
low-mass, irregular clusters at z ~ 0.5 there is no evidence of the MDR [314] . The MDR 
is anyway evolving also in massive clusters, as the ETG fraction {Jetg) decreases from 
z ~ to z ~ 1 [m [3151 [3161 [312 [313] (see Fig. The observed Jetg evolution 
does not concern Es, which seem to be in place in the cluster environment well before 
z > 1, but SOs, or rotationally supported spheroidals |317| . whose fraction increases with 
decreasing z at the expense of S and Irr in the highest density regions. The SO fraction 
is 0.46 ± 0.06 in local clusters, and decreases by a factor ^ 2 in z ~ 1 clusters |312| . In 
z ~ 1 clusters, SOs are ~ 0.5 Gyr younger than Es of similar luminosities [318j . Most 
of the SO fraction evolution occurs at z < 0.5 [3191 and concerns low- luminosity galaxies 
[320] . 

3'2. Colors. - Most cluster galaxies are distributed in a narrow band in a color vs. 
magnitude diagram, the so-called color-magnitude relation, CMR hereafter [3211 13221 
1323] . In nearby groups and clusters the CMR is defined all the way down to dwarf 
galaxies [Ml IBSillSlS]. The CMR is well defined also in high-z clusters [HZl [El [5T5] 
and even for z « 2 protoclusters, but not so much for z « 3 protoclusters '328'. The 
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Fig. 13. - The MDR for z ^ I clusters (circles with error bars from [312j . dashed line from |313) ) 
compared with the same relation in local clusters (solid line from [3U71 1314] , dotted line from 
[308]). From [312] , 



mean color of the CMR galaxies indicates their stellar populations were formed at z > 2.5 



pm [3301 [831] . 

The CMR is rather tight in nearby clusters, but its mean color and scatter increase 
with clustercentric distance and decreasing local density [3321 13331 13341 1335] and at the 
fainter end of the LF [ 336] . The increased scatter suggests a younger average stellar 
population for the CMR galaxies in lower density regions, with an age gradient of 2.5 
Gyr from the cluster center to its outskirts [334] . A smaller age gradient is also seen in 
high-z clusters |318j . Spectral analyses show the field ETGs to be ^ 1 Gyr older than 
their cluster counterparts [337] . 

The CMR faint-end becomes less populated in high-z clusters [3391 13401 13311 1341] 
and this evolution occurs at different epochs depending on the cluster richness [338] (see 
Fig. [T3|) . Hence the faint-end of the red cluster galaxy LF flattens [3421 1343] , and the 
dwarf-to-giant red-galaxy number-ratio decreases [3441 13451, 1346] . with increasing z. As 
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Fig. 14. - The ratio of luminous-to-faint CMR galaxies as a function of redshift, shown separately 
for richer (circles) and poorer (squares) clusters. A low-z reference is also shown (diamond). 
From [3381. 



the number of faint red cluster galaxies diminishes with z, the number of blue cluster 
galaxies increases [338j . Such variations of the CMR with magnitude and z are however 
not seen in all clusters [347[ 13481 1327j , and this indicates that there is a significant cosmic 
variance in the evolution of cluster galaxy properties [334] . 

The z-evolution of the CMR was already noticed 30 years ago by Butcher & Oemler 
[349[ 1350] . They measured the red-galaxy fractions in two z ~ 0.4 clusters, and found 
them to be significantly lower than in nearby clusters. This phenomenology has since 
been known as the Butcher-Oemler effect (BOe hereafter). The BOe can also be seen as 
an increase of the clustering strength of blue galaxies with increasing z [351j . The BOe 
was initially suspected of being caused by field galaxies contamination, but spectroscopic 
observations have later confirmed the physical reality of the effect. As the fraction of blue 
cluster galaxies increases with z, also the fraction of galaxies with spectra characteristics 
of a young stellar population (e.g. the E+A galaxie a(^^')r ) increases |353i I354[ 1355] . a 
phenomenology known as the spectroscopic BOe. IR observations have discovered a 



(^®) E-l-A galaxies are galaxies characterized by an elliptical- like spectrum with strong Balmer 
lines 13521. 
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population of IR-bright emitters in medium-z clusters, not seen in nearby clusters [356[ 
13571 13581 13591 l360l 13611 |362], a phenomenology known as the IR-BOe. The BOe is 
stronger for fainter galaxies [363[ I364j and in the more external parts of galaxy clusters 

Altogether, studies of the CMR and BOe indicate that the color evolution of cluster 
galaxies occurs later for fainter galaxies, in lower-density regions, and in less massive 
clusters. The analyses of the MDR find exactly the same trends for the morphological 
evolution of cluster galaxies. Since there is a clear correlation between galaxy morphology 
and color, the obvious question is whether the morphology and color environmental 
dependences describe in fact the same physical phenomenon. There is evidence that 
galaxies in low-z clusters do not show any MDR at fixed age (as deduced from their 
spectral energy distributions), suggesting the MDR is in fact an age-density relation 
[366]. Similarly, the MDR of galaxies in 0.4 < z < 0.8 clusters can be derived from 
the SF-density relation (which is strictly related to the age-density relation) by using 
the average SF per morphological class [367] . On the other hand, the radial increase of 
the CMR scatter has been shown to be too strong to be entirely accounted for by the 
MDR [333| . Hence the relation between galaxy colors and environment seems to be more 
fundamental than the MDR. 

Morphological and color evolution do not seem however to proceed at the same speed. 
Morphological evolution seems to take longer than color evolution. This is indicated by 
the analysis of the evolution of the galaxy mass function [368] . by the presence of passive, 
red cluster galaxies with spiral morphologies [369113551 1370] . and by the presence of early- 
type S which have the same age of SOs in the highest-density cluster regions |366j- 

3'3. Masses. - The masses of cluster ETGs can be inferred from the analysis of the 
ETG fundamental plane (FP), a relation between the ETG internal velocity dispersions, 
their effective radii, and their effective brightnesses |371[ 13721 1373] . By studying the FP 
as a function of z, and based on spectrophotometric models, it is possible to constrain the 
ETG M/L evolution, their average formation redshift (z/), and the evolutionary history 
of their stellar populations [374| . FP studies indicate Zf > 2 for the most massive cluster 
ETGs, but more recent AI/L evolution and younger ages for ETGs of lower-masses 
[371 [373 [371 [373. These studies also indicate that cluster ETGs are - 0.4 Gyr older 
than field ETGs [378j , and that this age-difference increases with galaxy mass [375j . The 
M/L of SOs and E+A galaxies are different from the M/L of Es [3791 1374] , and this 
indicates that the former have experienced more recent episodes of SF. 

Overall, the results obtained from the study of the FP support the conclusions reached 
from the studies of the CMR and MDR. 

3'4. Luminosities. ~ Many analyses have shown the LF of group and cluster galax- 
ies to be different from the LF of field galaxies. In particular, compared to the LF of 
field galaxies, the LF of cluster galaxies is characterized by a steeper faint-end slope 
[3501 13511 [53 [351 [351 MM [5SB: 364, 386, 3871 [351 [SHI [SSI [Ml MM > a brighter 
characteristic magnitude [394i i395i i396i i397j , and a dip or plateau at intermediate lu- 



38 



A. BlVIANO 



-22 -20 ^-18 -16 



300 - 




' — ' ' ' ' 

14 16 18 20 

V 



Fig. 15. - The LF of the Coma cluster of galaxies, showing the excess of bright galaxies relative 
to a Schechter function (dashed line), a steep faint-end upturn and a plateau at intermediate 
luminosities. The solid line is a Gaussian-f Schechter fit to the data. From 1831 . 



minosities [Ml ISHOl ISMl SOOl ISHH] (see Fig. According to other studies, how- 

ever, there is no significant difference between the LF of cluster and field galaxies 

Different conclusions about the shape of the average LF of cluster galaxies may be 
reached if different clusters are characterized by different LFs, or if different studies 
consider different cluster regions and the LF changes with radius. While the cluster LF 
appears to be universal (see, e.g., j388j ). there is evidence that its shape does change with 
radius, its faint-end slope being shallower (and hence similar to the field LF) in the central 
regions of clusters [53 [Ml SOI MM SOZl iOHl SOSl 1113 (Ml Hill [Mi ISSSl- It is also 
possible that the reported faint-end slopes of cluster LFs are overestimated because of 
incorrect field-count subtractions [4121 13241 1393j . or, vice versa, underestimated because 
of a biased selection against low surface-brightness galaxies |413| . 

The ratio between the number of bright (giant) and of faint (dwarf) galaxies increases 
towards the cluster center, an effect known as luminosity segregation [11 1414] . Very bright 
cluster galaxies not only prefer the central regions, but are also closer to the mean cluster 
velocity, their cr^ being only < 1/2 the global cluster cr^ |201[|311| . Luminosity segregation 
is also found outside clusters (e.g. [4151 1416] ). 

Two other characteristics distinguish the luminosity content of clusters from the field, 
the presence of BCGs and of the intra-cluster diffuse light (ICL in the following). BCGs 
are quite different from field ETGs of the same luminosities, in particular since their sizes 
are bigger [4171 1418] . Moreover, BCGs are intimately related to their hosting cluster. In 
fact they live close to the bottom of the cluster potential [419[ 1267] , their luminosities 
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Fig. 16. - The central region of a, z — 0.39 X-ray selected galaxy cluster. Three companion 
galaxies are seen close to the BCG. Contours show r-band surface brightness levels and emphasize 
the presence of the ICL. Given the positions and velocities of the four galaxies, two of the 
companions are estimated to merge with the BCG in ~ 0.1 Gyr. From [428]. 



are correlated with the cluster total luminosities and masses [420] . and their main axes 
are aligned along the main cluster axes [421] . 

There is evidence of some ongoing SF activity in many BCGs, probably activated by 
the cooling of the intra-cluster gas [4221 1423j . but overall the average color of the BCG 
stellar populations suggests they formed at z > 2 |424j , and the total /C-band luminosity 
of massive cluster BCGs evolves passively with z. On the other hand, BCGs in clusters 
of low masses do show substantial stellar mass evolution, with a factor 2-4 increase in 
mass since z ^ 1 |425[ I426j . This is consistent with the fact that the fractional cluster 
luminosity in the BCG is anti-correlated with the total cluster light [ 420| . as if BCGs in 
massive clusters completed their assembly before their counterparts in low-mass clusters. 

Support for the hypothesis that at least some cluster BCGs have yet to complete 
their assembly comes from the observation that the luminosity ratio between the BCG 
and the second-ranked cluster galaxy is smaller for the less massive and irregular clusters 
[4271 1267] . Direct evidence for ongoing build-up of BCGs in some z ~ 0.4 clusters and 
groups has also been provided f 428[ 1429] (see Fig. [T6)l. 

The amount of ICL in groups and clusters is not very well constrained. It is estimated 
to be ~ 5-25% of the total cluster light [4301 ESU HSl [43l |43l|435], and as much as 
50% near cluster centers |436) . The properties of the BCGs and the ICL are related, 
suggesting a direct, physical link. Their colors and elongation axes are in fact similar. 
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and clusters with brighter BCGs have higher ICL surface brightnesses [437) . 

3'5. Nuclear activity. ~ There is observational evidence that the fraction of active 
galactic nuclei {/agn) is higher in galaxy pairs j438j and in compact groups [439j . and 
lower in clusters j353j . relative to isolated field galaxies. However, other studies have 
found no environmental dependence of Jagn |4401 I441j . 

The striking difference between Jagn in compact groups and in clusters suggests it 
is not the density of the environment that matters for the onset of the AGN activity. In 
fact, galaxy number densities in compact groups and clusters are both very high. On 
the other hand, cluster a^s are typically ^ 3 times larger than those of compact groups. 
Hence, the onset of AGN activity may depend more on the relative galaxy velocities, than 
on galaxy density. A study of several hundred groups and clusters has indeed shown that 
Iagn is anti-correlated with the velocity dispersion of the host galaxy system [442] (see 
Fig. [HI). 

3'6. Physical processes. - Several physical processes are capable of affecting the prop- 
erties of cluster (and group) galaxies. They result from the interactions among galaxies, 
with the cluster gravitational potential, and with the intra-cluster medium j443| . The 
most important of these processes are the following: 

• dynamical friction |1461 1444j ; 



• galaxy-galaxy collisions [4451 1446] . leading to tidal effects and, eventually, mergers 
p7ll448] : 
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• tidal forces induced by the cluster gravitational potential, resulting in tidal trun- 
cation of the galactic halos [449 1 1450 1 1237]: 

• ram-pressure stripping of the galaxy gas [4511 1452] . 

Dynamical friction is the process that slows down a massive galaxy as it moves in a 
sea of dark matter particles. These particles are gravitationally focused in the wake of 
the galaxy itself. The galaxy therefore feels a braking force resulting from the excess 
mass density of the dark matter particles in the direction opposite to its motion. The 
characteristic timescale of this process is 



where Vg is the galaxy velocity, rUg its mass, and p is the dark matter density |146[ 1453] . 
Hence dynamical friction is more effective in higher density environments and for more 
massive galaxies and not very effective when the galaxy moves fast. 

Collisions and mergers drive substantial galaxy morphological modifications [448[|454] . 
and induce starburst episodes in the central galaxy regions [4551 14561 1457] , which then 
consume the available gas content. Gas can also be expelled from the galaxies as these 
collide and eventually merge [4481 1458] , since the total mass of the merger product is 
generally less than the sum of the progenitor masses. The characteristic timescale for a 
galaxy to experience a collision with another galaxy is 



where Vg is the relative velocity between the two galaxies, Vg the galaxy radius, and v 
is the galaxy number density [277] . Hence collisions happen more frequently in higher 
density environments and between larger galaxies. 

The impulsive tidal field produced by the collision produces collisional stripping |450j . 
This is most important for the material in the external regions of non-compact galaxies 
undergoing close encounters, since the tidal force is proportional to Vg/ (P , where d is the 
distance to the colliding galaxy. 

Galaxy-galaxy collisions may lead to mergers when the relative speed of the encounter 
is very low. The characteristic timescale for a merger to occur is 



(17) 



tdf oc 




(18) 



1 



tc oc 




(19) 




where is the cluster velocity dispersion and ag is the internal velocity dispersion of 
the galaxy [4591 1460] . Mergers are therefore very effective in low- velocity dispersion, 
high-density environments, such as groups and cluster outskirts (filaments). 
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Fig. 18. - This diagram indicates the clustercentric radius over which the hsted mechanisms are 
effective in shaping galaxy properties. Two clusters are considered, with M200 masses ~ 9 and 
~ 14 in units of lO^M© (C10024+17 and MS0451-03, respectively). Arrows indicate the cluster 
virial radii. The indicated tidal processes refer to interactions with the cluster potential, while 
the term "harassment" refers to the effect of galaxy-galaxy collisions. From [373] . 



A galaxy moving on a circular orbit in the core of a cluster with core radius Vc 
suffer tidal truncation of its external parts, beyond a radius |450j 



will 



(20) 



rt 



2ay 



Hence galaxies moving in massive and centrally concentrated clusters will be severely 
truncated, and their external (dark matter and gaseous) halos will become part of the 
intra-cluster matter as they come close to the cluster center [237] ■ Note however that 
galaxies sitting at the bottom of the cluster potential, i.e. BCGs, are not truncated by 
this mechanism since they feel symmetric external forces. Eq. (j20p may however not 
be applicable to field galaxies entering the cluster along filaments, since their orbits are 
mildly radially anisotropic, and not circular [2371 [226 . For these more general orbits, 
numerical simulations indicate that eq. (|20p is still valid, once rc/2 is replaced with the 
radius of the orbit pericenter [2371 1461j . 

The tidal forces induced by the cluster potential also lead to the compression of 
galactic gas and can therefore stimulate SF [4621 145 7j . 

A galaxy moving in a cluster will feel a pressure exerted by the intra-cluster diffuse 
gas onto the gas of its disk and the gas reservoir in its halo. The disk gas will be 
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stripped under the condition that the ram pressure of the intra-cluster gas exceeds the 
gravitational restoring force per unit area on the disk, 

(21) picvl > 27rGS,I]g,„ 

where pic is the intra-cluster gas density, and E^, and T,gas are the stellar and gaseous 
surface densities of the disk, respectively |4511I463] . In fact, eq. pT|) is strictly valid only 
if the baryons dominate the mass budget of the galaxy disk. As far as the gas in the 
galaxy halo is concerned, the stripping condition can be written as follows 

(22) pjcvl > a^^Y^^ 

where Rg is the projected galaxy radius in the direction transverse to the galaxy motion, 
Pgas is the 3-d galaxy gas-density profile and the a term, of order unity, depends on the 
precise shape of the gas and mass profiles of the galaxy [463j . From eqs. (PT|) and it 
is clear that it is easier to strip the gas from a lower-mass galaxy moving at higher speed 
in a denser intra-cluster medium. 

Ram-pressure, tidal truncation, coUisional stripping can all lead to starvation [4641 
1465] . Present-day spirals would use their disk gas content in a few Gyr at their current SF 
rates. If these SF rates must be sustained over a Hubble time, the disks must be refueled 
by gas reservoirs in galaxy halos. Starvation results from removing the gas reservoir. 

It is important to note that the effectiveness of the different processes depend on 
the clustercentric distances and on the cluster masses. E.g. merging will operate more 
effectively in the cluster outskirts and for lower-mass clusters, and ram-pressure stripping 
in the cluster inner regions and for higher-mass clusters (see Fig. [T8| . 

3'7. A scenario for the evolution of galaxies in clusters. - What makes the properties 
of cluster galaxies differ from those of field galaxies? Do galaxies change their colors, 
morphologies, luminosities, and even nuclear activities, as they move from regions of 
lower density to regions of higher density under the influence of gravity? Or are galaxy 
properties established once and for ever since their formation? The two scenarios confront 
each other, but are not necessarily mutually exclusive. 

Several characteristics of the cluster galaxy population argue in favor of the scenario 
in which galaxy properties are established ab initio. In particular, both the CMR and 
the MDR seem to be established already in z > 1 clusters, and the color and FP-inferred 
Af/L-evolution of bright cluster ETGs indicate simple passive evolution since Zf > 2. 
However, other observations argue for a modification of the galaxy properties with time. 
In particular, there is a deficit of both faint ETGs and SOs in high-z clusters, as indicated 
by studies of the CMR, the MDR and the LF. This deficit is more conspicuous for less 
massive and more irregular clusters. The analyses of the CMR, the FP, and the ETG 
spectra also show that the average age of the ETG stellar population increases with the 
local density. 
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Fig. 19. - The morphological mix of galaxies as a function of clustercentric radius in simulations 
(dashed line; filled and open squares correspond to bright and faint galaxies) as compared to 
real data (solid line, from [309] 1. The agreement is excellent but the definition of 'numerical 
SOs' has been widened to include 'numerical early-spirals', otherwise a deficit of SOs compared 
to the observational sample would have resulted. From [166] . 



A natural expectation of hierarchical cosmological models is biased galaxy formation 
(e.g. |466j ). galaxies in denser environments form earlier, and are more massive than 
those formed later, because they form with an initially larger mass and undergo more 
mergers during their lifetime. These models are able to approximately reproduce the 
environmental dependences of galaxy properties [4671 11661 14681 14691 1470j (see Fig. [T9)) . 
Mergers increase the masses of galaxies, destroy spiral features and disks, reduce angular 
momentum. They also trigger a starburst that consumes part of the available galaxy 
gas. Mergers are therefore effective in driving a LTG to ETG transition. 

A merger can also form and/or activate a central black hole, hence an AGN, perhaps 
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Fig. 20. - Spatial distribution of members of the A1763 galaxy cluster at z ~ 0.2. Starburst 
galaxies identified by their IR emission are indicated with filled dots. The triangle indicates 
an AGN. Circles of 1.5 Mpc are drawn around the central positions of A1763 and its neighbor 
A1770. Shaded regions highlight the kinematically-detected filaments, where most starburst 
galaxies are located [482]. 



with a certain delay after the starburst [471j . This is observationally supported by the 
fact that AGN host galaxies have colors and morphologies suggestive of a 1-4 Gyr old 
starburst associated with a merger event j472j , and by the fact that a merger model fits 
the relation between Jag n and (t^ of galaxy systems |442| . The resulting AGN feedback 
could quench any remaining SF activity |473[ I474j , more strongly so in more massive 
galaxies, characterized by larger black hole masses [475] . The AGN energy output into 
the intra-group medium can also explain the phenomenology of group X-ray scaling 
relations [476] . 

Mergers of small galaxies or galaxy subunits should be able to form red Es at the 
cluster centers at times when cluster cr^jS have not yet become too high. A merger origin 
of cluster Es is indeed suggested by their flat metallicity and color gradients [477[ I478j . 
Mergers become inefficient at later times in high density regions, when these become 
characterized by high values of ct« (see eq. [19]). But mergers can still operate in lower- 
density regions. Hence, low-density regions should be populated by galaxies forming 
at later times, and hence grown up to smaller masses, in agreement with the observed 
trends of galaxy age with density and radius inferred from the analyses of the CMR 
relation. At intermediate z, the IR-BOe is suggestive of mergers among galaxies located 
in medium-density environments but outside the highest-density cluster regions [479] . At 
lower z, the merger activity is restricted to galaxies outside the cluster virial regions, as 
indicated by the high fraction of starburst galaxies detected in the relatively high-density. 
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but low-cr„ cluster-feeding filaments |480[|481[|482| (see Fig. [20]). Most of these starburst 
galaxies have small stellar masses. 

Summing up, high-density regions are the birthplace of massive galaxies, which as- 
sembly most of their mass at early times, before mergers become inefficient. Mergers 
stimulate SF and AGNs that successively quench any further SF. In lower-density envi- 
ronments less massive galaxies are formed, and the formation process is delayed. SF hence 
stops earlier in more massive galaxies in higher-density environments. At least qualita- 
tively, this scenario is consistent with the observation that the critical mass above which 
SF is suppressed increases with z, and more rapidly in denser environments [4831 136S] . 
the so-called "downsizing" effect [484]. 

According to numerical simulations, mergers of cluster galaxies can also make BCGs 
[4851 1301] . Support for a merger origin of BCGs comes from their shallow surface- 
brightness profiles [486j . BCG assembly can continue until very late, and is only ^ 50% 
finished by z ~ 0.5 [301] . Direct observational evidence that BCGs continue their mass 
growth at z < 0.5 has recently been provided [428[|429] . The more recent mass evolution 
of BCGs in lower-mass clusters |425[ 1426] is just another manifestation of the downsizing 
process. Cluster cTyS would generally be too high for BCGs to continue growing at 
recent times, even in relatively low-mass clusters, but dynamical friction can decrease 
the relative velocities of the more massive cluster galaxies orbiting the cluster central 
regions. Velocity segregation of the most luminous cluster galaxies is indeed observed 
[201j and is also predicted by numerical simulations [166j . Dynamical friction and mergers 
of the brightest galaxies in the central regions of galaxy systems can also account for the 
intermediate-luminosity dip and the brighter characteristic luminosity of the cluster LF, 
relative to the field LF [4871 [4881 [489] . 

Galaxy mergers must also be important in the formation of the ICL. The ICL char- 
acteristics are clearly related to the characteristics of the BCG, suggesting their common 
origin. The product of galaxy mergers is unlikely to contain 100% of the mass of their 
progenitors, as part of the stars of the initial galaxies are scattered away during the 
galaxy-galaxy collision. If ~ 1/3 of the stars of the merging galaxies are scattered into 
the ICL component at each merger event, it is possible to reconcile current hierarchical 
cosmological models with the lack of evolution since z ^ 1 of the stellar mass function 
at the high-mass end f 490l 14911 1492| . Numerical simulations predict that if the ICL is 
built through galaxy-galaxy collisions its fractional contribution to the total cluster light 
should be almost independent on cluster mass [493] . a prediction consistent with some 
observational estimates |494| , and in contrast with others [495] . Anyway, direct evidence 
for the ICL creation through galaxy-galaxy collisions comes from observations of com- 
pact groups [496j . Dwarf galaxies can form in the tidal tails of galaxy-galaxy collisions 
[497| contributing to steepen the LF faint-end of galaxy clusters. These tidal dwarfs 
are then rapidly destroyed in high-density regions by collisions with other galaxies, thus 
explaining why a flatter LF characterizes cluster central regions. 

Mergers and dynamical friction are therefore important processes in the evolution of 
cluster and group galaxies. On the other hand, the abundance of SO galaxies in clusters 
and its late-evolution are more difficult to explain [467[ 11661 1468| . Low-luminosity SOs 
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could form from spirals through the fading of the spiral disks resulting from gas removal 
(starvation) |498[ 1373] . The production of SOs from spirals is suggested by the relative 
evolution of the LFs of blue and red galaxies [338j and by the presence of a cluster galaxy 
population intermediate between ETGs and LTGs, the population of passively-evolving 
galaxies with spiral arms and red disks [499|, I373j . These passive spirals are generally 
also characterized by a very low-HI gas content given their stellar mass or luminosity 
(see, e.g., [50011501) ). The formation of SOs via the fading of spiral disks is also suggested 
by the analyses of the TuUy-Fisher relation and the globular cluster fraction of cluster 
SOs |5021 15031 1504] , as well as by the smaller disk sizes of cluster spirals relative to field 
spirals [50511506] . 

The precise mechanism by which cluster spirals loose their gas can vary, both with 
the mass of the accreting cluster, and with the distance from the cluster center [5071 
1373] ■ In the central regions of massive clusters, ram-pressure stripping is expected to 
be the dominant mechanism [508] . but tidal shocks in the cluster potential can also 
be important if the cluster mass distribution is highly concentrated [509] . At larger 
radii, and in low-mass clusters and groups, galaxy-galaxy collisions and related tidal 
stripping should dominate [3061 15101 1511] . These collisions can produce central bursts 
of SF, accelerating the disk gas consumption. There is indeed conspicuous observational 
evidence that at least some cluster (or group) SOs have experienced recent episodes of SF 
[SnilinHlinillSSniinillSZKniiKSIll- if processes operating in the cluster outskirts are 
effective enough in removing the gas of inf ailing field spirals, processes operating near 
the cluster center may play little role in the evolution of cluster galaxies [3061 15171 1469j . 

The fading of spiral disks reduces galaxy luminosities by > 1 mag, hence it is impos- 
sible to form bright SOs by this mechanism. In order to produce bright SOs from spirals, 
the spiral bulge mass must increase. This can occur by a central burst of SF |517| induced 
by minor mergers |5181l5l9] and/or major tidal heating. The tidal heating can be caused 
by galaxy-galaxy collisions or by the gradient of the cluster gravitational potential |277j . 
When galaxies collide at high speed, no merger can occur and tidal damage is limited, 
hence bright cluster SOs can only form at high-z before the cluster cr„ becomes too large. 
At low-z, bright SOs can form in small, low-cr^ groups. As these groups are accreted by 
clusters, they then become part of the cluster galaxy population j4881 14431 1520] . Accreted 
groups are identified as subclusters, and an excess of SOs in subclusters in the central 
regions of nearby clusters has indeed been observed [H^ . 

Possible progenitors of the bright SOs are early-type spirals (Se). In fact, the bulge 
luminosities of Se are comparable to those of SOs |521j . Se and SOs occupy regions of 
similar (albeit somewhat lower) local densities |522| , the stellar populations of the Se and 
SOs that reside in high density environments have similar ages |366] , and Se and SOs move 
on similar, nearly- isotropic, orbits [226] . Other plausible progenitors of the bright SOs 
are the E-l-A galaxies. Spectrophotometric modeling indicates that E-l-A cluster galaxies 
should passively evolve into bright SOs [523] . E-l-A galaxies have positive color gradients 
[524j and have close companions and/or tidal features indicative of interactions |525j . 
all features consistent with a merger origin of these galaxies. Recently, a population of 
dusty starburst galaxies, bright enough to evolve into bright SOs after the SF episode has 
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ceased, has been identified in z ~ 0.5 clusters, using observations with the Spitzer satellite 
[360( 1361] , Many of these presumed SO-progenitors are found to reside in subclusters, 
where the galaxies can collide at relatively low speeds. 

Albeit very sketchy, the scenario outlined above tries to account for a large part (if not 
all) of the observational evidences of the environmental dependence of galaxy properties 
as a function of z. With a precise theory of galaxy formation and evolution yet to 
be formulated, and an infinity of data yet to be collected, our current (rather modest) 
understanding of galaxy evolution in clusters will certainly have to be revised in the near 
future. 

* * * 

These lectures were partly written during a three-months "Poste Rouge" stay at 
the Institut d'Astrophysics de Paris, whose kind hospitality I wish to acknowledge. I 
am grateful to Stefano Borgani, Andrea Lapi, Gary Mamon, and Piero Rosati for helpful 
comments and conversations. I also wish to thank Alfonso Cavaliere and Yoel Rephaeli for 
inviting me to give these lectures at the course on the "Astrophysics of Galaxy Clusters" 
of the International School of Physics "Enrico Fermi" . Finally, it is my pleasure to 
dedicate this work to my wife Patrizia, whose continuous support has proven fundamental 
to the success of it. 
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